Toward Less Constrained Macro-Neural Architecture Search.

IEEE Trans Neural Netw Learn Syst

Published: October 2023

Networks found with neural architecture search (NAS) achieve the state-of-the-art performance in a variety of tasks, out-performing human-designed networks. However, most NAS methods heavily rely on human-defined assumptions that constrain the search: architecture's outer skeletons, number of layers, parameter heuristics, and search spaces. In addition, common search spaces consist of repeatable modules (cells) instead of fully exploring the architecture's search space by designing entire architectures (macro-search). Imposing such constraints requires deep human expertise and restricts the search to predefined settings. In this article, we propose less constrained macro-neural architecture search (LCMNAS), a method that pushes NAS to less constrained search spaces by performing macro-search without relying on predefined heuristics or bounded search spaces. LCMNAS introduces three components for the NAS pipeline: 1) a method that leverages information about well-known architectures to autonomously generate complex search spaces based on weighted directed graphs (WDGs) with hidden properties; 2) an evolutionary search strategy that generates complete architectures from scratch; and 3) a mixed-performance estimation approach that combines information about architectures at the initialization stage and lower fidelity estimates to infer their trainability and capacity to model complex functions. We present experiments in 14 different datasets showing that LCMNAS is capable of generating both cell and macro-based architectures with minimal GPU computation and state-of-the-art results. Moreover, we conduct extensive studies on the importance of different NAS components in both cell and macro-based settings. The code for reproducibility is publicly available at https://github.com/VascoLopes/LCMNAS.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2023.3326648DOI Listing

Publication Analysis

Top Keywords

search spaces
20
search
12
architecture search
12
constrained macro-neural
8
macro-neural architecture
8
cell macro-based
8
nas
5
spaces
5
architectures
5
search networks
4

Similar Publications

Exploring the conformational space of molecules remains a challenge of fundamental importance to quantum chemistry: identification of relevant conformers at ambient conditions enables predictive simulations of almost arbitrary properties. Here, we propose a novel approach, called TTConf, to enable conformational sampling of large organic molecules where the combinatorial explosion of possible conformers prevents the use of a brute-force systematic conformer search. We employ tensor trains as a highly efficient dimensionality reduction algorithm, effectively reducing the scaling from exponential to polynomial.

View Article and Find Full Text PDF

Paying attention to the SARS-CoV-2 dialect : a deep neural network approach to predicting novel protein mutations.

Commun Biol

January 2025

Dept. Electrical Engineering and Computer Science, Florida Atlantic University, 777 Glades Road, Boca Raton, FL, 33431, USA.

Predicting novel mutations has long-lasting impacts on life science research. Traditionally, this problem is addressed through wet-lab experiments, which are often expensive and time consuming. The recent advancement in neural language models has provided stunning results in modeling and deciphering sequences.

View Article and Find Full Text PDF

Multimodal multiobjective optimization with structural network control principles to optimize personalized drug targets for drug discovery of individual patients.

Brief Bioinform

November 2024

School of Electrical and Information Engineering, Zhengzhou University, No. 100, Science Avenue, Hightech District, Zhengzhou City 450001, Henan Province, China.

Structural network control principles provided novel and efficient clues for the optimization of personalized drug targets (PDTs) related to state transitions of individual patients. However, most existing methods focus on one subnetwork or module as drug targets through the identification of the minimal set of driver nodes and ignore the state transition capabilities of other modules with different configurations of drug targets [i.e.

View Article and Find Full Text PDF

Social media sites like X (formerly Twitter) increasingly serve as spaces for the public to discuss controversial topics. Social media can spark extreme viewpoints and spread biased or inaccurate information while simultaneously allowing for debate around policy-relevant topics. The arrest of Joseph J.

View Article and Find Full Text PDF

This research utilizes time series models to forecast electricity generation from renewable energy sources and electricity consumption. The configuration of optimal parameters for these models typically requires optimization algorithms, but conventional algorithms may struggle with fixed search patterns and limited robustness. To address this, we propose an auto-evolution hyper-heuristic algorithm named AE-GAPB.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!