Publication | Open Access
Efficiency Evaluation of Character-level RNN Training Schedules
13
Citations
4
References
2016
Year
Training SystemEngineeringMachine LearningRecurrent Neural NetworkSpeech RecognitionData ScienceEfficiency EvaluationNeural Scaling LawPrediction ModellingLarge Ai ModelSequence ModellingPredictive AnalyticsComputer ScienceTraining BudgetForecastingDeep LearningPredictive LearningPrediction SchedulesPrediction ScheduleSpeech Processing
We present four training and prediction schedules from the same character-level recurrent neural network. The efficiency of these schedules is tested in terms of model effectiveness as a function of training time and amount of training data seen. We show that the choice of training and prediction schedule potentially has a considerable impact on the prediction effectiveness for a given training budget.
| Year | Citations | |
|---|---|---|
Page 1
Page 1