Publication | Open Access
Firefly Neural Architecture Descent: a General Approach for Growing Neural Networks
16
Citations
0
References
2021
Year
Artificial IntelligenceIncremental LearningEngineeringMachine LearningGeneral ApproachRecurrent Neural NetworkFirefly DescentEvolution StrategyData ScienceSparse Neural NetworkFirefly AlgorithmComputer ScienceNeural NetworksDeep LearningNeural Architecture SearchEvolving Neural NetworkAerospace EngineeringComputational NeuroscienceEvolutionary BiologyNeuronal Network
We propose firefly neural architecture descent, a general framework for progressively and dynamically growing neural networks to jointly optimize the networks' parameters and architectures. Our method works in a steepest descent fashion, which iteratively finds the best network within a functional neighborhood of the original network that includes a diverse set of candidate network structures. By using Taylor approximation, the optimal network structure in the neighborhood can be found with a greedy selection procedure. We show that firefly descent can flexibly grow networks both wider and deeper, and can be applied to learn accurate but resource-efficient neural architectures that avoid catastrophic forgetting in continual learning. Empirically, firefly descent achieves promising results on both neural architecture search and continual learning. In particular, on a challenging continual image classification task, it learns networks that are smaller in size but have higher average accuracy than those learned by the state-of-the-art methods.