Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Factorized visual representations in the primate visual system and deep neural networks

Journal Article · · eLife
DOI:https://doi.org/10.7554/elife.91685· OSTI ID:2471917
Object classification has been proposed as a principal objective of the primate ventral visual stream and has been used as an optimization target for deep neural network models (DNNs) of the visual system. However, visual brain areas represent many different types of information, and optimizing for classification of object identity alone does not constrain how other information may be encoded in visual representations. Information about different scene parameters may be discarded altogether (‘invariance’), represented in non-interfering subspaces of population activity (‘factorization’) or encoded in an entangled fashion. In this work, we provide evidence that factorization is a normative principle of biological visual representations. In the monkey ventral visual hierarchy, we found that factorization of object pose and background information from object identity increased in higher-level regions and strongly contributed to improving object identity decoding performance. We then conducted a large-scale analysis of factorization of individual scene parameters – lighting, background, camera viewpoint, and object pose – in a diverse library of DNN models of the visual system. Models which best matched neural, fMRI, and behavioral data from both monkeys and humans across 12 datasets tended to be those which factorized scene parameters most strongly. Notably, invariance to these parameters was not as consistently associated with matches to neural and behavioral data, suggesting that maintaining non-class information in factorized activity subspaces is often preferred to dropping it altogether. Thus, we propose that factorization of visual scene information is a widely used strategy in brains and DNN models thereof.
Research Organization:
Krell Institute, Ames, IA (United States)
Sponsoring Organization:
USDOE Office of Science (SC)
Grant/Contract Number:
SC0020347
OSTI ID:
2471917
Journal Information:
eLife, Journal Name: eLife Vol. 13; ISSN 2050-084X
Publisher:
eLife Sciences Publications, Ltd.Copyright Statement
Country of Publication:
United States
Language:
English

References (38)

The Code for Facial Identity in the Primate Brain journal June 2017
The Geometry of Abstraction in the Hippocampus and Prefrontal Cortex journal November 2020
Brain hierarchy score: Which deep neural networks are hierarchically brain-like? journal September 2021
A Channel for 3D Environmental Shape in Anterior Inferotemporal Cortex journal October 2014
Optimal Degrees of Synaptic Connectivity journal March 2017
Tuned geometries of hippocampal representations meet the computational demands of social memory journal April 2024
Untangling invariant object recognition journal August 2007
The ventral visual pathway: an expanded neural framework for the processing of object quality journal January 2013
Representational geometry: integrating cognition, computation, and the brain journal August 2013
Norm-based face encoding by single neurons in the monkey inferotemporal cortex journal July 2006
Identifying natural images from human brain activity journal March 2008
The importance of mixed selectivity in complex cognitive tasks journal May 2013
A face feature space in the macaque temporal lobe journal August 2009
Parallel, multi-stage processing of colors, faces and shapes in macaque inferior temporal cortex journal October 2013
Explicit information for category-orthogonal object properties increases along the ventral stream journal February 2016
A new neural framework for visuospatial processing journal March 2011
Primary visual cortex straightens natural video trajectories journal October 2021
Unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons journal November 2021
Abstract representations emerge naturally in neural networks trained to perform multiple tasks journal February 2023
Capturing the objects of vision with neural networks journal September 2021
Perceptual straightening of natural videos journal April 2019
Performance-optimized hierarchical models predict neural responses in higher visual cortex journal May 2014
Neural representational geometry underlies few-shot concept learning journal October 2022
Classification and Geometry of General Perceptual Manifolds journal July 2018
Discovering important people and objects for egocentric video summarization conference June 2012
Deep Residual Learning for Image Recognition conference June 2016
Momentum Contrast for Unsupervised Visual Representation Learning conference June 2020
Unsupervised Visual Representation Learning by Context Prediction conference December 2015
Mask R-CNN conference October 2017
A Cortical Region Consisting Entirely of Face-Selective Cells journal February 2006
Functional Compartmentalization and Viewpoint Generalization Within the Macaque Face-Processing System journal November 2010
What Is the Goal of Sensory Coding? journal July 1994
Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition journal December 2014
Deep image reconstruction from human brain activity journal January 2019
Selectivity and Tolerance ("Invariance") Both Increase as Visual Information Propagates from Cortical Area V4 to IT journal September 2010
Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks journal July 2018
Simple Learned Weighted Sums of Inferior Temporal Neuronal Firing Rates Accurately Predict Human Core Object Recognition Performance journal September 2015
Balanced Increases in Selectivity and Tolerance Produce Constant Sparseness along the Ventral Visual Stream journal July 2012

Similar Records

Uncertainty-Informed Volume Visualization using Implicit Neural Representation
Conference · Tue Oct 01 00:00:00 EDT 2024 · OSTI ID:2538110

Visualization for Classification in Deep Neural Networks
Conference · Sun Oct 01 00:00:00 EDT 2017 · OSTI ID:1407764

Chromatic information and feature detection in fast visual analysis
Journal Article · Mon Aug 01 00:00:00 EDT 2016 · PLoS ONE · OSTI ID:1333683