Justifying and generalizing contrastive divergence.

Neural Comput

Department of Computer Science and Operations Research, University of Montreal, Montreal, Quebec, Canada.

Published: June 2009

We study an expansion of the log likelihood in undirected graphical models such as the restricted Boltzmann machine (RBM), where each term in the expansion is associated with a sample in a Gibbs chain alternating between two random variables (the visible vector and the hidden vector in RBMs). We are particularly interested in estimators of the gradient of the log likelihood obtained through this expansion. We show that its residual term converges to zero, justifying the use of a truncation--running only a short Gibbs chain, which is the main idea behind the contrastive divergence (CD) estimator of the log-likelihood gradient. By truncating even more, we obtain a stochastic reconstruction error, related through a mean-field approximation to the reconstruction error often used to train autoassociators and stacked autoassociators. The derivation is not specific to the particular parametric forms used in RBMs and requires only convergence of the Gibbs chain. We present theoretical and empirical evidence linking the number of Gibbs steps k and the magnitude of the RBM parameters to the bias in the CD estimator. These experiments also suggest that the sign of the CD estimator is correct most of the time, even when the bias is large, so that CD-k is a good descent direction even for small k.

Download full-text PDF

Source
http://dx.doi.org/10.1162/neco.2008.11-07-647DOI Listing

Publication Analysis

Top Keywords

gibbs chain
12
contrastive divergence
8
log likelihood
8
reconstruction error
8
justifying generalizing
4
generalizing contrastive
4
divergence study
4
study expansion
4
expansion log
4
likelihood undirected
4

Similar Publications

Self-cleaning applications based on bionic surface designs requires an in-depth understanding of unique and complex wetting and evaporation processes of sessile droplets on natural biosurfaces. To this end, hydrophobic bamboo and Kalanchoe blossfeldiana leaves are excellent candidates for self-cleaning applications, but various properties, such as the heat and mass transfer processes during evaporation, remain unknown. Here, the dynamics of contact angle, radius, and heat and mass transfer during evaporation of sessile droplets on bamboo and Kalanchoe blossfeldiana leaves with roughness in the range 2.

View Article and Find Full Text PDF

Alzheimer disease is a neurodegenerative pathology-modifying mitochondrial metabolism with energy impairments where the effects of biological sex and DNA repair deficiencies are unclear. We investigated the therapeutic potential of dietary ketosis alone or with supplemental nicotinamide riboside (NR) on hippocampal intermediary metabolism and mitochondrial bioenergetics in older male and female wild-type (Wt) and 3xTgAD-DNA polymerase-β-deficient (3xTg/POLβ) (AD) mice. DNA polymerase-β is a key enzyme in DNA base excision repair (BER) of oxidative damage that may also contribute to mitochondrial DNA repair.

View Article and Find Full Text PDF
Article Synopsis
  • The study examines how the surface properties and electrical conductivity of betaine-based ionic liquids change when mixed with different concentrations of gabapentin at a specific temperature.
  • The findings show that as gabapentin concentration and the length of the alkyl chain increase, surface tension decreases, indicating stronger interactions.
  • Additionally, micellization parameters improve with longer chains, while conductivity decreases at higher gabapentin concentrations due to increased viscosity and ion interactions.
View Article and Find Full Text PDF

In this work, we propose a path integral Monte Carlo approach based on discretized continuous degrees of freedom and rejection-free Gibbs sampling. The ground state properties of a chain of planar rotors with dipole-dipole interactions are used to illustrate the approach. Energetic and structural properties are computed and compared to exact diagonalization and numerical matrix multiplication for N ≤ 3 to assess the systematic Trotter factorization error convergence.

View Article and Find Full Text PDF

Bayesian analysis of multivariate mixed longitudinal ordinal and continuous data.

Aust N Z J Stat

September 2024

Department of Mathematical Sciences, Michigan Technological University, Houghton, MI, 49931, USA.

Multivariate longitudinal ordinal and continuous data exist in many scientific fields. However, it is a rigorous task to jointly analyse them due to the complicated correlated structures of those mixed data and the lack of a multivariate distribution. The multivariate probit model, assuming there is a multivariate normal latent variable for each multivariate ordinal data, becomes a natural modeling choice for longitudinal ordinal data especially for jointly analysing with longitudinal continuous data.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!