Publication | Open Access
Bat detective—Deep learning tools for bat acoustic signal detection
200
Citations
51
References
2018
Year
EngineeringMachine LearningData ScienceDeep LearningHealth SciencesBioacousticsDeep Learning AlgorithmsAudio AnalysisPassive Acoustic SensingSpeech ProcessingBat SpeciesAcoustic SensorUltrasoundAcoustic Signal ProcessingDistant Speech RecognitionBiosonarSpeech Recognition
Passive acoustic sensing is a powerful tool for assessing anthropogenic impacts on biodiversity, yet existing bat detection tools are largely commercial or focus only on species classification, leaving a gap in accurate, open‑source call localization, especially in noisy recordings. The authors aimed to create an open‑source convolutional neural network pipeline that detects ultrasonic, full‑spectrum search‑phase echolocation calls of bats. The pipeline was trained on full‑spectrum ultrasonic audio collected along European road transects and labeled by citizen scientists, and then applied to five years of monitoring data from Jersey, demonstrating its adaptability to other species with suitable training data. The CNN outperformed existing algorithms and commercial systems in detecting search‑phase calls, enabling automatic large‑scale bat population monitoring and establishing that deep learning can provide accurate, efficient audio surveillance.
Passive acoustic sensing has emerged as a powerful tool for quantifying anthropogenic impacts on biodiversity, especially for echolocating bat species. To better assess bat population trends there is a critical need for accurate, reliable, and open source tools that allow the detection and classification of bat calls in large collections of audio recordings. The majority of existing tools are commercial or have focused on the species classification task, neglecting the important problem of first localizing echolocation calls in audio which is particularly problematic in noisy recordings. We developed a convolutional neural network based open-source pipeline for detecting ultrasonic, full-spectrum, search-phase calls produced by echolocating bats. Our deep learning algorithms were trained on full-spectrum ultrasonic audio collected along road-transects across Europe and labelled by citizen scientists from www.batdetective.org. When compared to other existing algorithms and commercial systems, we show significantly higher detection performance of search-phase echolocation calls with our test sets. As an example application, we ran our detection pipeline on bat monitoring data collected over five years from Jersey (UK), and compared results to a widely-used commercial system. Our detection pipeline can be used for the automatic detection and monitoring of bat populations, and further facilitates their use as indicator species on a large scale. Our proposed pipeline makes only a small number of bat specific design decisions, and with appropriate training data it could be applied to detecting other species in audio. A crucial novelty of our work is showing that with careful, non-trivial, design and implementation considerations, state-of-the-art deep learning methods can be used for accurate and efficient monitoring in audio.
| Year | Citations | |
|---|---|---|
Page 1
Page 1