Publication | Closed Access
Polyphonic music transcription by non-negative sparse coding of power spectra
143
Citations
12
References
2004
Year
MusicSpectral TheorySource SeparationEngineeringMachine LearningSpectrum EstimationPolyphonic Piano MusicSpeech RecognitionSpeech CodingData SciencePattern RecognitionAudio Signal ProcessingHealth SciencesAudio RetrievalComputer ScienceIndependent Spectral FeaturesSignal ProcessingDigital AudioPolyphonic TranscriptionCompressive SensingSpectral AnalysisSpeech ProcessingPolyphonic Music TranscriptionSignal Separation
We present a system for adaptive spectral basis decomposition that learns to identify independent spectral features given a sequence of short-term Fourier spectra. When applied to recordings of polyphonic piano music, the individual notes are identified as salient features, and hence each short-term spectrum is decomposed into a sum of note spectra; the resulting encoding can be used as a basis for polyphonic transcription. The system is based on a probabilistic model equivalent to a form of noisy independent component analysis (ICA) or sparse coding with non-negativity constraints. We introduce a novel modification to this model that recognises that a short-term Fourier spectrum can be thought of as a noisy realisation of the power spectral density of an underlying Gaussian process, where the noise is essentially multiplicative and non-Gaussian. Results are presented for an analysis of a live recording of polyphonic piano music.
| Year | Citations | |
|---|---|---|
Page 1
Page 1