Publication | Closed Access
Two highly efficient second-order algorithms for training feedforward networks
162
Citations
26
References
2002
Year
Numerical AnalysisModel OptimizationEngineeringMachine LearningComputational Learning TheoryTraining TaskEfficient Second-order AlgorithmsStandard Lm IterationLarge Scale OptimizationComputer ScienceNonlinear OptimizationDeep LearningSignal ProcessingConvergence AnalysisAdaptive Optimization
We present two highly efficient second-order algorithms for the training of multilayer feedforward neural networks. The algorithms are based on iterations of the form employed in the Levenberg-Marquardt (LM) method for nonlinear least squares problems with the inclusion of an additional adaptive momentum term arising from the formulation of the training task as a constrained optimization problem. Their implementation requires minimal additional computations compared to a standard LM iteration. Simulations of large scale classical neural-network benchmarks are presented which reveal the power of the two methods to obtain solutions in difficult problems, whereas other standard second-order techniques (including LM) fail to converge.
| Year | Citations | |
|---|---|---|
Page 1
Page 1