Publication | Open Access
The SEMAINE Database: Annotated Multimodal Records of Emotionally Colored Conversations between a Person and a Limited Agent
662
Citations
32
References
2011
Year
Voice InteractionSpeech SciencesAffective NeuroscienceSemaine DatabaseCommunicationMultimodal Sentiment AnalysisPsychologySocial SciencesSpeech RecognitionAutomatic SystemAffective ComputingMultimodal InteractionConversation AnalysisAutomatic RecognitionEmotionally Colored ConversationsContent AnalysisHealth SciencesMultimodal Signal ProcessingLimited AgentSpeech CommunicationSolid SalInterpersonal CommunicationVoiceSal AgentSocial ComputingSpeech AcousticsHuman-computer InteractionSpeech ProcessingParalinguisticsSpeech PerceptionEmotionSpeech InterfaceEmotion RecognitionNonverbal Communication
SEMAINE created a large audiovisual database of 959 five‑minute conversations with 150 participants, recording high‑resolution video and audio while users interacted with simulated SAL agents in various configurations, and transcribing and annotating the data with affective dimensions, FACS, and engagement metrics. The material is available through a web‑accessible database.
SEMAINE has created a large audiovisual database as a part of an iterative approach to building Sensitive Artificial Listener (SAL) agents that can engage a person in a sustained, emotionally colored conversation. Data used to build the agents came from interactions between users and an "operator” simulating a SAL agent, in different configurations: Solid SAL (designed so that operators displayed an appropriate nonverbal behavior) and Semi-automatic SAL (designed so that users' experience approximated interacting with a machine). We then recorded user interactions with the developed system, Automatic SAL, comparing the most communicatively competent version to versions with reduced nonverbal skills. High quality recording was provided by five high-resolution, high-framerate cameras, and four microphones, recorded synchronously. Recordings total 150 participants, for a total of 959 conversations with individual SAL characters, lasting approximately 5 minutes each. Solid SAL recordings are transcribed and extensively annotated: 6-8 raters per clip traced five affective dimensions and 27 associated categories. Other scenarios are labeled on the same pattern, but less fully. Additional information includes FACS annotation on selected extracts, identification of laughs, nods, and shakes, and measures of user engagement with the automatic system. The material is available through a web-accessible database.
| Year | Citations | |
|---|---|---|
Page 1
Page 1