Concepedia

Publication | Closed Access

Extreme Learning Machine for Multilayer Perceptron

1.4K

Citations

29

References

2015

Year

TLDR

Extreme learning machine (ELM) is a single‑hidden‑layer feedforward neural network with randomly initialized hidden nodes and analytically computed output weights, but its shallow architecture limits effective feature learning for natural signals such as images and videos. This work proposes an ELM‑based hierarchical learning framework for multilayer perceptrons to overcome these limitations. The framework performs unsupervised multilayer encoding via an ELM‑based sparse autoencoder with L1 regularization, then randomly projects the encoded outputs and trains each hidden layer forward without fine‑tuning, yielding compact, meaningful features and faster learning. Experiments on standard classification datasets and computer‑vision tasks show the method achieves faster convergence and higher accuracy than existing hierarchical learning approaches and outperforms deep learning in learning efficiency.

Abstract

Extreme learning machine (ELM) is an emerging learning algorithm for the generalized single hidden layer feedforward neural networks, of which the hidden node parameters are randomly generated and the output weights are analytically computed. However, due to its shallow architecture, feature learning using ELM may not be effective for natural signals (e.g., images/videos), even with a large number of hidden nodes. To address this issue, in this paper, a new ELM-based hierarchical learning framework is proposed for multilayer perceptron. The proposed architecture is divided into two main components: 1) self-taught feature extraction followed by supervised feature classification and 2) they are bridged by random initialized hidden weights. The novelties of this paper are as follows: 1) unsupervised multilayer encoding is conducted for feature extraction, and an ELM-based sparse autoencoder is developed via l1 constraint. By doing so, it achieves more compact and meaningful feature representations than the original ELM; 2) by exploiting the advantages of ELM random feature mapping, the hierarchically encoded outputs are randomly projected before final decision making, which leads to a better generalization with faster learning speed; and 3) unlike the greedy layerwise training of deep learning (DL), the hidden layers of the proposed framework are trained in a forward manner. Once the previous layer is established, the weights of the current layer are fixed without fine-tuning. Therefore, it has much better learning efficiency than the DL. Extensive experiments on various widely used classification data sets show that the proposed algorithm achieves better and faster convergence than the existing state-of-the-art hierarchical learning methods. Furthermore, multiple applications in computer vision further confirm the generality and capability of the proposed learning scheme.

References

YearCitations

Page 1