Microarray and RNA-sequencing (RNA-seq) techniques each produce gene expression data that can be expressed as a matrix that often contains missing values. Thus, a process of missing-value imputation that uses coherence information of the dataset is necessary. Existing imputation methods, such as iterative bicluster-based least squares (bi-iLS), use biclustering to estimate the missing values because genes are only similar under correlative experimental conditions. Also, they use the row average to obtain a temporary complete matrix, but the use of the row average is considered to be a flaw. The row average cannot reflect the real structure of the dataset because the row average only uses the information of an individual row. Therefore, we propose the use of Bayesian principal component analysis (BPCA) to obtain the temporary complete matrix instead of using the row average in bi-iLS. This alteration produces new missing values imputation method called iterative bicluster-based Bayesian principal component analysis and least squares (bi-BPCA-iLS). Several experiments have been conducted on two-dimension independent gene expression datasets, which are microarray (e.g., cell-cycle expression dataset of yeast saccharomyces cerevisiae) and RNA-seq (gene expression data from schizosaccharomyces pombe) datasets. In the case of the microarray dataset, our proposed bi-BPCA-iLS method showed a significant overall improvement in the normalized root mean square error (NRMSE) values of 10.6% from the local least squares (LLS) and 0.6% from the bi-iLS. In the case of the RNA-seq dataset, our proposed bi-BPCA-iLS method showed an overall improvement in the NRMSE values of 8.2% from the LLS and 3.1% from the bi-iLS. The additional computational time of bi-BPCA-iLS is not significant compared to bi-iLS.

Download full-text PDF

Source
http://dx.doi.org/10.3934/mbe.2022405DOI Listing

Publication Analysis

Top Keywords

row average
20
iterative bicluster-based
12
bayesian principal
12
principal component
12
component analysis
12
gene expression
12
missing values
12
bicluster-based bayesian
8
analysis squares
8
missing-value imputation
8

Similar Publications

Tooth replacement of the filter-feeding pterosaur Forfexopterus and its implications for ecological adaptation.

An Acad Bras Cienc

January 2025

Shandong University of Science and Technology, College of Earth Science and Engineering, 579, Qianwangang Road, Huangdao, Qingdao, Shandong Province, 266590, China.

A "comb-dentition", characterized by long, needle-like, and closely-spaced teeth, is found in the ctenochasmatid pterosaurs as an adaptation for filter-feeding. However, little is known about their tooth replacement pattern, hindering our understanding of the development of the filter-feeding apparatus of the clade. Here, we describe the tooth replacement of the pterosaur Forfexopterus from the Jehol Biota based on high-resolution X-ray Computed Tomography (CT) reconstruction.

View Article and Find Full Text PDF

Mapping the myomagnetic field of a straight and easily accessible muscle after electrical stimulation using triaxial optically pumped magnetometers (OPMs) to assess potential benefits for magnetomyography (MMG). Approach: Six triaxial OPMs were arranged in two rows with three sensors each along the abductor digiti minimi (ADM) muscle. The upper row of sensors was inclined by 45° with respect to the lower row and all sensors were aligned closely to the skin surface without direct contact.

View Article and Find Full Text PDF

Detecting and recovering a low-rank signal in a noisy data matrix is a fundamental task in data analysis. Typically, this task is addressed by inspecting and manipulating the spectrum of the observed data, e.g.

View Article and Find Full Text PDF

. This study aims to enhance positron emission tomography (PET) imaging systems by developing a continuous depth-of-interaction (DOI) measurement technique using a single-ended readout. Our primary focus is on reducing the number of readout channels in the scintillation detectors while maintaining accurate DOI estimations, using a high-pass filter-based signal multiplexing technique combined with artificial neural networks (ANNs).

View Article and Find Full Text PDF

The effect of planting density on producing quality seed tubers using shoot tip cuttings and conventional methods from tubers has not been studied in Ethiopia. An experiment was conducted to determine the effects of spacing on seed tuber yield and related traits of potato cultivars at Adet Agricultural Research Center in northwestern Ethiopia during the 2023 cropping season. The treatments consisted of two potato varieties (Belete and Gera) propagated by shoot tip cuttings at four inter-row spacings (30, 40, 50, and 60 cm) and intra-row spacing (15, 20, 25, and 30 cm).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!