skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Efficacy of using a dynamic length representation vs. a fixed-length for neuroarchitecture search

Conference ·

Deep learning neuroarchitecture and hyperparameter search are important in finding the best configuration that maximizes learned model accuracy. However, the number of types of layers, their associated hyperparameters, and the myriad of ways to connect layers poses a significant computational challenge in discovering ideal model configurations. Here, we assess two different approaches for neuroarchitecture search for a LeNet style neural network, one that uses a fixed-length approach where there is a preset number of possible layers that can be toggled on or off via mutation, and a variable-length approach where layers can be freely added or removed via special mutation operators. We found that the variable-length implementation trained better models while discovering unusual layer configurations worth further exploration.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-00OR22725
OSTI ID:
2429829
Resource Relation:
Conference: Genetic Algorithms and Evolutionary Computation Conference (GECCO) - Melbourne, , Australia - 7/14/2024 8:00:00 AM-7/18/2024 8:00:00 AM
Country of Publication:
United States
Language:
English

Similar Records

Revisiting matrix-based inversion of scanning mobility particle sizer (SMPS) and humidified tandem differential mobility analyzer (HTDMA) data
Journal Article · 2021 · Atmospheric Measurement Techniques (Online) · OSTI ID:2429829

HyperSpace: Distributed Bayesian Hyperparameter Optimization
Conference · 2018 · 2018 30TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2018) · OSTI ID:2429829

HyperSpace: Distributed Bayesian Hyperparameter Optimization
Journal Article · 2018 · Proceedings (Symposium on Computer Architecture and High Performance Computing) · OSTI ID:2429829

Related Subjects