Concepedia

TLDR

Preprocessing, especially normalization, is essential for robust analysis of complex biofluid spectra, with integral normalization being the current standard in 1H NMR metabonomics. This study demonstrates that integral normalization is suboptimal and introduces probabilistic quotient normalization as a superior alternative. PQN estimates a most probable dilution factor by analyzing the distribution of amplitude quotients between a test spectrum and a reference spectrum. PQN outperforms integral and vector length normalization, yielding more robust and accurate scaling even in samples with large metabolite variations, as shown by simulations, cyclosporin‑A urine studies, and over 4000 control animal spectra.

Abstract

For the analysis of the spectra of complex biofluids, preprocessing methods play a crucial role in rendering the subsequent data analyses more robust and accurate. Normalization is a preprocessing method, which accounts for different dilutions of samples by scaling the spectra to the same virtual overall concentration. In the field of 1H NMR metabonomics integral normalization, which scales spectra to the same total integral, is the de facto standard. In this work, it is shown that integral normalization is a suboptimal method for normalizing spectra from metabonomic studies. Especially strong metabonomic changes, evident as massive amounts of single metabolites in samples, significantly hamper the integral normalization resulting in incorrectly scaled spectra. The probabilistic quotient normalization is introduced in this work. This method is based on the calculation of a most probable dilution factor by looking at the distribution of the quotients of the amplitudes of a test spectrum by those of a reference spectrum. Simulated spectra, spectra of urine samples from a metabonomic study with cyclosporin-A as the active compound, and spectra of more than 4000 samples of control animals demonstrate that the probabilistic quotient normalization is by far more robust and more accurate than the widespread integral normalization and vector length normalization.

References

YearCitations

Page 1