Publication | Open Access
Smart audio signal classification for tracking of construction tasks
19
Citations
20
References
2024
Year
MusicEngineeringMachine LearningSpeech RecognitionData SciencePattern RecognitionAudio AnalysisRobust Speech RecognitionAutomation In ConstructionHealth SciencesSignal ClassificationStructural Health MonitoringAudio RetrievalDeep LearningDistant Speech RecognitionSignal ProcessingDeep Neural NetworksAudio MiningCivil EngineeringSpeech ProcessingConstruction ManagementMfcc ValuesConstruction EngineeringMfccs-lstm Architectures
This paper presents a model for sound classification in construction that leverages a unique combination of Mel spectrograms and Mel-Frequency Cepstral Coefficient (MFCC) values. This model combines deep neural networks like Convolution Neural Networks (CNN) and Long short-term memory (LSTM) to create CNN-LSTM and MFCCs-LSTM architectures, enabling the extraction of spectral and temporal features from audio data. The audio data, generated from construction activities in a real-time closed environment is used to evaluate the proposed model and resulted in an overall Precision, Recall, and F1-score of 91%, 89%, and 91%, respectively. This performance surpasses other established models, including Deep Neural Networks (DNN), CNN, and Recurrent Neural Networks (RNN), as well as a combination of these models as CNN-DNN, CNN-RNN, and CNN-LSTM. These results underscore the potential of combining Mel spectrograms and MFCC values to provide a more informative representation of sound data, thereby enhancing sound classification in noisy environments.
| Year | Citations | |
|---|---|---|
Page 1
Page 1