Publication | Closed Access
Fast-Rir: Fast Neural Diffuse Room Impulse Response Generator
33
Citations
28
References
2022
Year
EngineeringNeural RecodingRecurrent Neural NetworkAcoustic ModelingSpeech RecognitionRoom Impulse ResponsesAcoustic Signal ProcessingAcoustic EnvironmentAcoustic AnalysisHealth SciencesDiffuse ReflectionsComputer EngineeringComputer ScienceDistant Speech RecognitionSignal ProcessingSpeech CommunicationSpeech TechnologyComputational NeuroscienceMulti-speaker Speech RecognitionSpeech AcousticsSpeech ProcessingNeuroscienceAuditory ComputationBrain-like ComputingSpeech Perception
We present a neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment. Our FAST-RIR takes rectangular room dimensions, listener and speaker positions, and reverberation time (T <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">60</inf> ) as inputs and generates specular and diffuse reflections for a given acoustic environment. Our FAST-RIR is capable of generating RIRs for a given input T <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">60</inf> with an average error of 0.02s. We evaluate our generated RIRs in automatic speech recognition (ASR) applications using Google Speech API, Microsoft Speech API, and Kaldi tools. We show that our proposed FAST-RIR with batch size 1 is 400 times faster than a state-of-the-art diffuse acoustic simulator (DAS) on a CPU and gives similar performance to DAS in ASR experiments. Our FAST-RIR is 12 times faster than an existing GPU-based RIR generator (gpuRIR). We show that our FAST-RIR outperforms gpuRIR by 2.5% in an AMI far-field ASR benchmark.
| Year | Citations | |
|---|---|---|
Page 1
Page 1