Publication | Closed Access
Why natural gradient?
156
Citations
2
References
2002
Year
Unknown Venue
Model OptimizationMachine VisionMachine LearningData ScienceEngineeringPattern RecognitionNatural Gradient AdaptationDerivative-free OptimizationLarge Scale OptimizationNatural GradientDeep LearningGradient AdaptationSupervised LearningNatural FunctionComputer VisionAdaptive Optimization
Gradient adaptation is a useful technique for adjusting a set of parameters to minimize a cost function. While often easy to implement, the convergence speed of gradient adaptation can be slow when the slope of the cost function varies widely for small changes in the parameters. In this paper, we outline an alternative technique, termed natural gradient adaptation, that overcomes the poor convergence properties of gradient adaptation in many cases. The natural gradient is based on differential geometry and employs knowledge of the Riemannian structure of the parameter space to adjust the gradient search direction. Unlike Newton's method, natural gradient adaptation does not assume a locally-quadratic cost function. Moreover, for maximum likelihood estimation tasks, natural gradient adaptation is asymptotically Fisher-efficient. A simple example illustrates the desirable properties of natural gradient adaptation.
| Year | Citations | |
|---|---|---|
Page 1
Page 1