Concepedia

Publication | Open Access

An Empirical Analysis of Deep Network Loss Surfaces

22

Citations

17

References

2017

Year

Abstract

The training of deep neural networks is a high-dimension optimization problem with respect to the loss function of a model. Unfortunately, these functions are of high dimension and non-convex and hence difficult to characterize. In this paper, we empirically investigate the geometry of the loss functions for state-of-the-art networks with multiple stochastic optimization methods. We do this through several experiments that are visualized on polygons to understand how and when these stochastic optimization methods find minima.

References

YearCitations

Page 1