Unconditional scene inference and generation are challenging to learn jointly with a single compositional model. Despite encouraging progress on models that extract object-centric representations ("slots") from images, unconditional generation of scenes from slots has received less attention. This is primarily because learning the multiobject relations necessary to imagine coherent scenes is difficult. We hypothesize that most existing slot-based models have a limited ability to learn object correlations. We propose two improvements that strengthen object correlation learning. The first is to condition the slots on a global, scene-level variable that captures higher-order correlations between slots. Second, we address the fundamental lack of a canonical order for objects in images by proposing to learn a consistent order to use for the autoregressive generation of scene objects. Specifically, we train an autoregressive slot prior to sequentially generate scene objects following a learned order. Ordered slot inference entails first estimating a randomly ordered set of slots using existing approaches for extracting slots from images, then aligning those slots to ordered slots generated autoregressively with the slot prior. Our experiments across three multiobject environments demonstrate clear gains in unconditional scene generation quality. Detailed ablation studies are also provided that validate the two proposed improvements.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1162/neco_a_01635 | DOI Listing |
Sci Rep
December 2024
College of Animal Science and Technology, Inner Mongolia Minzu University, Tongliao, 028000, China.
The aim of this experiment was to investigate the effects of rumen fluid and molasses on the nutrient composition, fermentation quality, and microflora of Caragana korshinskii Kom. The trial included four treatments: a control group (CK) without additives and experimental groups supplemented with 7% rumen fluid (R), 4% molasses (M), and 7% rumen fluid + 4% molasses (RM). 15 days and 60 days of ensiling.
View Article and Find Full Text PDFSci Rep
December 2024
School of Physical Education, Shanghai University of Sport, Shanghai, 200438, China.
Objective: This study aimed to examine the levels of physical activity (PA), sleep, and mental health (MH), specifically depression, anxiety, and stress, among Chinese university students. It also aimed to analyze the influencing factors of MH, providing a theoretical foundation for developing intervention programs to improve college students' mental health.
Methods: A stratified, clustered, and phased sampling method was employed.
Sci Rep
December 2024
Department of Electrical Engineering, College of Engineering, Qassim University, Buraidah, 52571, Saudi Arabia.
Unbalanced power systems cause transformers and generators to overheat, system losses to climb, and protective devices to trigger. An optimization-based control technique for distributed generators (DG) balances demand and improves power quality in three imbalanced distribution systems with 10, 13, and 37 nodes. Each system phase has its own DG.
View Article and Find Full Text PDFSci Rep
December 2024
Plum Island Animal Disease Center, Agricultural Research Service, USDA, Greenport, NY, 11944, USA.
For over a century African swine fever (ASF) has been causing outbreaks leading to devastating losses for the swine industry. The current pandemic of ASF has shown no signs of stopping and continues to spread causing outbreaks in additional countries. Currently control relies mostly on culling infected farms, and strict biosecurity procedures.
View Article and Find Full Text PDFSci Rep
December 2024
Institute of Soil and Water Conservation, Northwest A&F University, Yangling, 712100, Shaanxi, China.
Reservoir-operation optimisation is a crucial aspect of water-resource development and sustainable water process management. This study addresses bi-objective optimisation problems by proposing a novel crossover evolution operator, known as the hybrid simulated binary and improved arithmetic crossover (SBAX) operator, based on the simulated binary cross (SBX) and arithmetic crossover operators, and applies it to the Non-dominated Sorting Genetic Algorithms-II (NSGA-II) algorithm to improve the algorithm. In particular, the arithmetic crossover operator can obtain an optimal solution more precisely within the solution space, whereas the SBX operator can explore a broader range of potential high-quality solutions.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!