Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Toward Improving the Generation Quality of Autoregressive Slot VAEs

Journal Article · · Neural Computation
DOI:https://doi.org/10.1162/neco_a_01635· OSTI ID:2377975

Unconditional scene inference and generation are challenging to learn jointly with a single compositional model. Despite encouraging progress on models that extract object-centric representations (“slots”) from images, unconditional generation of scenes from slots has received less attention. This is primarily because learning the multiobject relations necessary to imagine coherent scenes is difficult. We hypothesize that most existing slot-based models have a limited ability to learn object correlations. We propose two improvements that strengthen object correlation learning. The first is to condition the slots on a global, scene-level variable that captures higher-order correlations between slots. Second, we address the fundamental lack of a canonical order for objects in images by proposing to learn a consistent order to use for the autoregressive generation of scene objects. Specifically, we train an autoregressive slot prior to sequentially generate scene objects following a learned order. Ordered slot inference entails first estimating a randomly ordered set of slots using existing approaches for extracting slots from images, then aligning those slots to ordered slots generated autoregressively with the slot prior. Our experiments across three multiobject environments demonstrate clear gains in unconditional scene generation quality. Detailed ablation studies are also provided that validate the two proposed improvements.

Research Organization:
National Renewable Energy Laboratory (NREL), Golden, CO (United States)
Sponsoring Organization:
USDOE National Renewable Energy Laboratory (NREL), Laboratory Directed Research and Development (LDRD) Program
DOE Contract Number:
AC36-08GO28308
OSTI ID:
2377975
Report Number(s):
NREL/JA-2C00-90373; MainId:92151; UUID:d9381a8f-fe63-4927-a88f-8b99ff7af027; MainAdminId:72949
Journal Information:
Neural Computation, Journal Name: Neural Computation Journal Issue: 5 Vol. 36
Country of Publication:
United States
Language:
English

References (16)

Building machines that learn and think like people journal November 2016
Array programming with NumPy journal September 2020
Dark, Beyond Deep: A Paradigm Shift to Cognitive AI with Humanlike Common Sense journal March 2020
The reviewing of object files: Object-specific integration of information journal April 1992
Vision as Bayesian inference: analysis by synthesis? journal July 2006
Infants' Physical World journal June 2004
Toward Causal Representation Learning journal May 2021
Matplotlib: A 2D Graphics Environment journal January 2007
Algorithms for the Assignment and Transportation Problems journal March 1957
Long Short-Term Memory journal November 1997
Simulation as an engine of physical scene understanding journal October 2013
Unsupervised Learning of Temporal Abstractions With Slot-Based Transformers journal March 2023
Comparing partitions journal December 1985
Core knowledge journal December 2006
Objective Criteria for the Evaluation of Clustering Methods journal December 1971
Set-to-Sequence Methods in Machine Learning: A Review journal August 2021

Similar Records

Non-autoregressive time-series methods for stable parametric reduced-order models
Journal Article · Tue Aug 25 00:00:00 EDT 2020 · Physics of Fluids · OSTI ID:1768748

Statistical aspects of autoregressive models in the assessment of radon mitigation
Technical Report · Sat Feb 28 23:00:00 EST 1987 · OSTI ID:6222182