Publication | Open Access
HiTrans: A Transformer-Based Context- and Speaker-Sensitive Model for Emotion Detection in Conversations
79
Citations
25
References
2020
Year
Unknown Venue
Utterance RepresentationsEngineeringHierarchical TransformersSpoken Language ProcessingSpoken Dialog SystemCommunicationMultimodal Sentiment AnalysisCorpus LinguisticsSocial SciencesEmotion DetectionSpeech RecognitionNatural Language ProcessingComputational LinguisticsAffective ComputingConversation AnalysisVoice RecognitionSpeaker-sensitive ModelMachine TranslationComputer ScienceDeep LearningSpeech CommunicationSpeech AnalysisMulti-speaker Speech RecognitionSpeech ProcessingParalinguisticsEmotionLinguisticsEmotion Recognition
Emotion detection in conversations (EDC) is to detect the emotion for each utterance in conversations that have multiple speakers. Different from the traditional non-conversational emotion detection, the model for EDC should be context-sensitive (e.g., understanding the whole conversation rather than one utterance) and speaker-sensitive (e.g., understanding which utterance belongs to which speaker). In this paper, we propose a transformer-based context- and speaker-sensitive model for EDC, namely HiTrans, which consists of two hierarchical transformers. We utilize BERT as the low-level transformer to generate local utterance representations, and feed them into another high-level transformer so that utterance representations could be sensitive to the global context of the conversation. Moreover, we exploit an auxiliary task to make our model speaker-sensitive, called pairwise utterance speaker verification (PUSV), which aims to classify whether two utterances belong to the same speaker. We evaluate our model on three benchmark datasets, namely EmoryNLP, MELD and IEMOCAP. Results show that our model outperforms previous state-of-the-art models.
| Year | Citations | |
|---|---|---|
Page 1
Page 1