Publication | Closed Access
Learning rate schedules for faster stochastic gradient search
216
Citations
6
References
2003
Year
Unknown Venue
Model OptimizationIncremental LearningEngineeringMachine LearningStochastic OptimizationStochastic Gradient DescentRate SchedulesLarge Scale OptimizationAdaptive MemoryNew MethodologyComputer ScienceAdaptive AlgorithmScenario GenerationOptimal RateAdaptive Optimization
The authors propose a new methodology for creating the first automatically adapting learning rates that achieve the optimal rate of convergence for stochastic gradient descent. Empirical tests agree with theoretical expectations that drift can be used to determine whether the crucial parameter c is large enough. Using this statistic, it will be possible to produce the first adaptive learning rates which converge at optimal speed.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>
| Year | Citations | |
|---|---|---|
Page 1
Page 1