This article considers the regression problem with sparse Bayesian learning (SBL) when the number of weights P is larger than the data size N , i.e., P >> N . The situation induces overfitting and makes regression tasks, such as prediction and basis selection, challenging. We show a strategy to address this problem. Our strategy consists of two steps. The first is to apply an inverse gamma hyperprior with a shape parameter close to zero over the noise precision of automatic relevance determination (ARD) prior. This hyperprior is associated with the concept of a weakly informative prior in terms of enhancing sparsity. The model sparsity can be controlled by adjusting a scale parameter of inverse gamma hyperprior, leading to the prevention of overfitting. The second is to select an optimal scale parameter. We develop an extended predictive information criterion (EPIC) for optimal selection. We investigate the strategy through relevance vector machine (RVM) with a multiple-kernel scheme dealing with highly nonlinear data, including smooth and less smooth regions. This setting is one form of the regression task with SBL in the P >> N situation. As an empirical evaluation, regression analyses on four artificial datasets and eight real datasets are performed. We see that the overfitting is prevented, while predictive performance may be not drastically superior to comparative methods. Our methods allow us to select a small number of nonzero weights while keeping the model sparse. Thus, the methods are expected to be useful for basis and variable selection.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2021.3131357DOI Listing

Publication Analysis

Top Keywords

sparse bayesian
8
bayesian learning
8
weakly informative
8
extended predictive
8
predictive criterion
8
inverse gamma
8
gamma hyperprior
8
scale parameter
8
learning weakly
4
hyperprior
4

Similar Publications

Objective: Accurate preoperative evaluation of myometrial invasion (MI) is essential for treatment decisions in endometrial cancer (EC). However, the diagnostic accuracy of commonly utilized magnetic resonance imaging (MRI) techniques for this assessment exhibits considerable variability. This study aims to enhance preoperative discrimination of absence or presence of MI by developing and validating a multimodal deep learning radiomics (MDLR) model based on MRI.

View Article and Find Full Text PDF
Article Synopsis
  • This study introduces a new Bayesian method for identifying structural damage using a technique called Improved Elemental Modal Strain Energy Ratio (IEMSER) to create a more focused prior distribution.
  • The approach uses measured frequencies and mode shapes to develop the IEMSER indicator, which informs damage assessment and guides the Markov Chain Monte Carlo (MCMC) sampling to find accurate damage estimates.
  • Results from numerical tests on a steel truss bridge and modal data from an 18-story frame structure demonstrate that this method enhances the accuracy of damage identification by effectively utilizing prior information.
View Article and Find Full Text PDF

Estimation of genetic admixture proportions via haplotypes.

Comput Struct Biotechnol J

December 2024

Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA.

Estimation of ancestral admixture is essential for creating personal genealogies, studying human history, and conducting genome-wide association studies (GWAS). The following three primary methods exist for estimating admixture coefficients. The frequentist approach directly maximizes the binomial loglikelihood.

View Article and Find Full Text PDF

Robust source reconstruction of atmospheric radionuclides from observations of different sparsity with spatial preselection and non-smooth constraints.

J Hazard Mater

December 2024

Institute of Nuclear and New Energy Technology, Collaborative Innovation Centre of Advanced Nuclear Energy Technology, Key Laboratory of Advanced Reactor Engineering and Safety of Ministry of Education, Tsinghua University, Beijing 100084, China. Electronic address:

As the global-nuclear-capacity-tripling plan is implemented, reconstruction of the source locations and release rates of atmospheric radionuclides becomes increasingly important for the environment and human health. However, such reconstruction is vulnerable to unrealistic solutions because it is ill-posed. This study proposed a spatiotemporally constrained reconstruction method that excludes false estimates and achieves high accuracy.

View Article and Find Full Text PDF

Variational autoencoders (VAEs) employ Bayesian inference to interpret sensory inputs, mirroring processes that occur in primate vision across both ventral (Higgins et al., 2021) and dorsal (Vafaii et al., 2023) pathways.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!