Publication | Open Access
An Empirical Analysis of Deep Network Loss Surfaces
22
Citations
17
References
2017
Year
Artificial IntelligenceGeometric LearningModel OptimizationDeep Neural NetworksEmpirical AnalysisMachine LearningData ScienceLoss FunctionsEngineeringSparse Neural NetworkMedical Image ComputingLoss FunctionLarge Scale OptimizationComputer ScienceDeep LearningNeural Architecture SearchNeural Scaling Law
The training of deep neural networks is a high-dimension optimization problem with respect to the loss function of a model. Unfortunately, these functions are of high dimension and non-convex and hence difficult to characterize. In this paper, we empirically investigate the geometry of the loss functions for state-of-the-art networks with multiple stochastic optimization methods. We do this through several experiments that are visualized on polygons to understand how and when these stochastic optimization methods find minima.
| Year | Citations | |
|---|---|---|
Page 1
Page 1