Publication | Open Access
Bridging Nonlinearities and Stochastic Regularizers with Gaussian Error Linear Units
753
Citations
19
References
2016
Year
Stochastic RegularizerConvolutional Neural NetworkEngineeringMachine LearningAutoencodersGelu NonlinearitySocial SciencesData ScienceSparse Neural NetworkStochastic RegularizersRegularization (Mathematics)Approximation TheoryElu ActivationsInverse ProblemsComputer ScienceNonlinear Signal ProcessingDeep LearningStochastic Differential EquationComputational NeuroscienceGaussian ProcessStochastic CalculusNeuronal NetworkNeuroscienceBrain-like Computing
We propose the Gaussian Error Linear Unit (GELU), a high-performing neural network activation function. The GELU nonlinearity is the expected transformation of a stochastic regularizer which randomly applies the identity or zero map, combining the intuitions of dropout and zoneout while respecting neuron values. This connection suggests a new probabilistic understanding of nonlinearities. We perform an empirical evaluation of the GELU nonlinearity against the ReLU and ELU activations and find performance improvements across all tasks.
| Year | Citations | |
|---|---|---|
Page 1
Page 1