Publication | Open Access
Multi-Task Deep Neural Networks for Natural Language Understanding
220
Citations
27
References
2019
Year
Llm Fine-tuningEngineeringMachine LearningCross-lingual RepresentationGlue TasksRepresentation LearningNatural Language ProcessingWord EmbeddingsSpeech RecognitionComputational LinguisticsMulti-task LearningLanguage StudiesMachine TranslationTen Nlu TasksLarge Ai ModelNlp TaskPre-trained ModelsGlue BenchmarkDeep LearningNatural Language UnderstandingLinguistics
MT‑DNN leverages large cross‑task data and benefits from a regularization effect that yields more general representations for new tasks and domains, extending Liu et al.’s model with a pre‑trained BERT transformer. The study presents MT‑DNN for learning representations across multiple NLU tasks and announces that its code and pretrained models will be publicly released.
In this paper, we present a Multi-Task Deep Neural Network (MT-DNN) for learning representations across multiple natural language understanding (NLU) tasks. MT-DNN not only leverages large amounts of cross-task data, but also benefits from a regularization effect that leads to more general representations to help adapt to new tasks and domains. MT-DNN extends the model proposed in Liu et al. (2015) by incorporating a pre-trained bidirectional transformer language model, known as BERT (Devlin et al., 2018). MT-DNN obtains new state-of-the-art results on ten NLU tasks, including SNLI, SciTail, and eight out of nine GLUE tasks, pushing the GLUE benchmark to 82.7% (2.2% absolute improvement) as of February 25, 2019 on the latest GLUE test set. We also demonstrate using the SNLI and SciTail datasets that the representations learned by MT-DNN allow domain adaptation with substantially fewer in-domain labels than the pre-trained BERT representations. Our code and pre-trained models will be made publicly available.
| Year | Citations | |
|---|---|---|
Page 1
Page 1