Publication | Closed Access
A statistical approach to learning and generalization in layered neural networks
244
Citations
26
References
1990
Year
EngineeringMachine LearningNetwork AnalysisMixture Of ExpertData SciencePattern RecognitionLayered NetworksLayered Neural NetworksGeneral Statistical DescriptionSupervised LearningComputational Learning TheoryMachine Learning ModelNetwork EstimationStatistical ApproachKnowledge DiscoveryGibbs DistributionComputer ScienceNetwork ModelingStatistical Learning TheoryNetwork ScienceEntropyClassifier System
A general statistical description of the problem of learning from examples is presented. Learning in layered networks is posed as a search in the network parameter space for a network that minimizes an additive error function of a statistically independent examples. By imposing the equivalence of the minimum error and the maximum likelihood criteria for training the network, the Gibbs distribution on the ensemble of networks with a fixed architecture is derived. The probability of correct prediction of a novel example can be expressed using the ensemble, serving as a measure to the network's generalization ability. The entropy of the prediction distribution is shown to be a consistent measure of the network's performance. The proposed formalism is applied to the problems of selecting an optimal architecture and the prediction of learning curves.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>
| Year | Citations | |
|---|---|---|
Page 1
Page 1