Logistic Regression-HSMM-based Heart Sound Segmentation

TLDR

Heart sound segmentation, the precise localization of first and second heart sounds in a phonocardiogram, is crucial for automatic analysis, and while threshold‑based methods have limited success, probabilistic models such as hidden Markov and hidden semi‑Markov models—especially when incorporating expected state durations—have shown superior performance. The study aims to accurately segment first and second heart sounds in noisy real‑world PCG recordings by extending an HSMM with logistic‑regression‑based emission probabilities. The authors implemented a logistic‑regression‑based HSMM with a modified Viterbi decoder and evaluated it on 10,172 s of PCG from 112 patients, comprising 12,181 first and 11,627 second heart sounds. The method achieved an average F1 score of 95.63 ± 0.85%, outperforming the state‑of‑the‑art 86.28 ± 1.55% and achieving statistically significant improvement via logistic‑regression emission probabilities and an extended Viterbi algorithm.

Abstract

The identification of the exact positions of the first and second heart sounds within a phonocardiogram (PCG), or heart sound segmentation, is an essential step in the automatic analysis of heart sound recordings, allowing for the classification of pathological events. While threshold-based segmentation methods have shown modest success, probabilistic models, such as hidden Markov models, have recently been shown to surpass the capabilities of previous methods. Segmentation performance is further improved when a priori information about the expected duration of the states is incorporated into the model, such as in a hidden semi-Markov model (HSMM). This paper addresses the problem of the accurate segmentation of the first and second heart sound within noisy real-world PCG recordings using an HSMM, extended with the use of logistic regression for emission probability estimation. In addition, we implement a modified Viterbi algorithm for decoding the most likely sequence of states, and evaluated this method on a large dataset of 10,172 s of PCG recorded from 112 patients (including 12,181 first and 11,627 second heart sounds). The proposed method achieved an average F1 score of 95.63 ± 0.85%, while the current state of the art achieved 86.28 ± 1.55% when evaluated on unseen test recordings. The greater discrimination between states afforded using logistic regression as opposed to the previous Gaussian distribution-based emission probability estimation as well as the use of an extended Viterbi algorithm allows this method to significantly outperform the current state-of-the-art method based on a two-sided paired t-test.

References

Page 1

	Year	Citations

Page 1