Publication | Closed Access
The ACM Multimedia 2022 Computational Paralinguistics Challenge
26
Citations
15
References
2022
Year
Deepspectrum ToolkitEngineeringMachine LearningSpoken Language ProcessingSpeech RecognitionNatural Language ProcessingData SciencePattern RecognitionComputational LinguisticsRobust Speech RecognitionVoice RecognitionLanguage StudiesReal-time LanguageAudeep ToolkitSpeech SynthesisAcm Multimedia 2022Deep LearningSpeech CommunicationSpeech ProcessingSpeech InputDeep Feature ExtractionLinguistics
The ACM Multimedia 2022 Computational Paralinguistics Challenge addresses four different problems for the first time in a research competition under well-defined conditions: In the Vocalisations and Stuttering Sub-Challenges, a classification on human non-verbal vocalisations and speech has to be made; the Activity Sub-Challenge aims at beyond-audio human activity recognition from smartwatch sensor data; and in the Mosquitoes Sub-Challenge, mosquitoes need to be detected. We describe the Sub-Challenges, baseline feature extraction, and classifiers based on the 'usual' ComParE and BoAW features, the auDeep toolkit, and deep feature extraction from pre-trained CNNs using the DeepSpectrum toolkit; in addition, we add end-to-end sequential modelling, and a log-mel-128-BNN.
| Year | Citations | |
|---|---|---|
Page 1
Page 1