Publication | Closed Access
Towards Faster Stochastic Gradient Search
93
Citations
6
References
1991
Year
Adaptive K-meansModel OptimizationEngineeringMachine LearningData ScienceStochastic OptimizationPattern RecognitionComputational Learning TheoryAlgorithmic LearningStochastic Gradient DescentLarge Scale OptimizationComputer ScienceLearning Rate ηDeep LearningAdaptive Optimization
Stochastic gradient descent is a general algorithm which includes LMS, on-line backpropagation, and adaptive k-means clustering as special cases. The standard choices of the learning rate η (both adaptive and fixed functions of time) often perform quite poorly. In contrast, our recently proposed class of search then converge learning rate schedules (Darken and Moody, 1990) display the theoretically optimal asymptotic convergence rate and a superior ability to escape from poor local minima. However, the user is responsible for setting a key parameter. We propose here a new methodology for creating the first completely automatic adaptive learning rates which achieve the optimal rate of convergence.
| Year | Citations | |
|---|---|---|
Page 1
Page 1