Publication | Open Access
ORCA-SPOT: An Automatic Killer Whale Sound Detection Toolkit Using Deep Learning
120
Citations
85
References
2019
Year
Large bioacoustic archives are valuable for uncovering communication patterns, but most contain few animal vocalizations amid extensive environmental noise, making manual extraction difficult, especially for socially complex species. The study aims to develop ORCA‑SPOT, a deep‑learning toolkit that automatically detects killer whale sounds in large acoustic datasets. By training neural networks on 11,509 killer‑whale calls and 34,848 noise samples, the authors applied ORCA‑SPOT to the 19,000‑hour Orchive, automatically segmenting 2.2 years of recordings in about eight days. ORCA‑SPOT achieved a 93.2 % precision (PPV) and 0.9523 AUC, enabling efficient annotation of large bioacoustic databases and offering adaptability to other species.
Abstract Large bioacoustic archives of wild animals are an important source to identify reappearing communication patterns, which can then be related to recurring behavioral patterns to advance the current understanding of intra-specific communication of non-human animals. A main challenge remains that most large-scale bioacoustic archives contain only a small percentage of animal vocalizations and a large amount of environmental noise, which makes it extremely difficult to manually retrieve sufficient vocalizations for further analysis – particularly important for species with advanced social systems and complex vocalizations. In this study deep neural networks were trained on 11,509 killer whale ( Orcinus orca ) signals and 34,848 noise segments. The resulting toolkit ORCA-SPOT was tested on a large-scale bioacoustic repository – the Orchive – comprising roughly 19,000 hours of killer whale underwater recordings. An automated segmentation of the entire Orchive recordings (about 2.2 years) took approximately 8 days. It achieved a time-based precision or positive-predictive-value (PPV) of 93.2% and an area-under-the-curve (AUC) of 0.9523. This approach enables an automated annotation procedure of large bioacoustics databases to extract killer whale sounds, which are essential for subsequent identification of significant communication patterns. The code will be publicly available in October 2019 to support the application of deep learning to bioaoucstic research. ORCA-SPOT can be adapted to other animal species.
| Year | Citations | |
|---|---|---|
Page 1
Page 1