We study an expansion of the log likelihood in undirected graphical models such as the restricted Boltzmann machine (RBM), where each term in the expansion is associated with a sample in a Gibbs chain alternating between two random variables (the visible vector and the hidden vector in RBMs). We are particularly interested in estimators of the gradient of the log likelihood obtained through this expansion. We show that its residual term converges to zero, justifying the use of a truncation--running only a short Gibbs chain, which is the main idea behind the contrastive divergence (CD) estimator of the log-likelihood gradient. By truncating even more, we obtain a stochastic reconstruction error, related through a mean-field approximation to the reconstruction error often used to train autoassociators and stacked autoassociators. The derivation is not specific to the particular parametric forms used in RBMs and requires only convergence of the Gibbs chain. We present theoretical and empirical evidence linking the number of Gibbs steps k and the magnitude of the RBM parameters to the bias in the CD estimator. These experiments also suggest that the sign of the CD estimator is correct most of the time, even when the bias is large, so that CD-k is a good descent direction even for small k.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1162/neco.2008.11-07-647 | DOI Listing |
Langmuir
January 2025
Tianjin Key Laboratory of Refrigeration Technology, Tianjin University of Commerce, Tianjin 300134, China.
Self-cleaning applications based on bionic surface designs requires an in-depth understanding of unique and complex wetting and evaporation processes of sessile droplets on natural biosurfaces. To this end, hydrophobic bamboo and Kalanchoe blossfeldiana leaves are excellent candidates for self-cleaning applications, but various properties, such as the heat and mass transfer processes during evaporation, remain unknown. Here, the dynamics of contact angle, radius, and heat and mass transfer during evaporation of sessile droplets on bamboo and Kalanchoe blossfeldiana leaves with roughness in the range 2.
View Article and Find Full Text PDFJ Neurochem
January 2025
The Laboratory of Molecular Gerontology, National Institute on Aging, National Institutes of Health, Bethesda, Maryland, USA.
Alzheimer disease is a neurodegenerative pathology-modifying mitochondrial metabolism with energy impairments where the effects of biological sex and DNA repair deficiencies are unclear. We investigated the therapeutic potential of dietary ketosis alone or with supplemental nicotinamide riboside (NR) on hippocampal intermediary metabolism and mitochondrial bioenergetics in older male and female wild-type (Wt) and 3xTgAD-DNA polymerase-β-deficient (3xTg/POLβ) (AD) mice. DNA polymerase-β is a key enzyme in DNA base excision repair (BER) of oxidative damage that may also contribute to mitochondrial DNA repair.
View Article and Find Full Text PDFSci Rep
January 2025
Department of Physical Chemistry, University of Tabriz, Tabriz, Iran.
J Chem Phys
January 2025
Department of Chemistry, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada.
In this work, we propose a path integral Monte Carlo approach based on discretized continuous degrees of freedom and rejection-free Gibbs sampling. The ground state properties of a chain of planar rotors with dipole-dipole interactions are used to illustrate the approach. Energetic and structural properties are computed and compared to exact diagonalization and numerical matrix multiplication for N ≤ 3 to assess the systematic Trotter factorization error convergence.
View Article and Find Full Text PDFAust N Z J Stat
September 2024
Department of Mathematical Sciences, Michigan Technological University, Houghton, MI, 49931, USA.
Multivariate longitudinal ordinal and continuous data exist in many scientific fields. However, it is a rigorous task to jointly analyse them due to the complicated correlated structures of those mixed data and the lack of a multivariate distribution. The multivariate probit model, assuming there is a multivariate normal latent variable for each multivariate ordinal data, becomes a natural modeling choice for longitudinal ordinal data especially for jointly analysing with longitudinal continuous data.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!