Bayesian modelling and quantification of Raman spectroscopy

Moores, Matthew and Gracie, Kirsten and Carson, Jake and Faulds, Karen and Graham, Duncan and Girolami, Mark (2018) Bayesian modelling and quantification of Raman spectroscopy. Annals of Applied Statistics. ISSN 1932-6157 (In Press)

[thumbnail of Moores-etal-AAS-2018-Bayesian-modelling-and-quantification-of-Raman-spectroscopy]
Preview
Text. Filename: Moores_etal_AAS_2018_Bayesian_modelling_and_quantification_of_Raman_spectroscopy.pdf
Accepted Author Manuscript

Download (2MB)| Preview

Abstract

Raman spectroscopy can be used to identify molecules such as DNA by the characteristic scattering of light from a laser. It is sensitive at very low concentrations and can accurately quantify the amount of a given molecule in a sample. The presence of a large, nonuniform background presents a major challenge to analysis of these spectra. To overcome this challenge, we introduce a sequential Monte Carlo (SMC) algorithm to separate each observed spectrum into a series of peaks plus a smoothly-varying baseline, corrupted by additive white noise. The peaks are modelled as Lorentzian, Gaussian, or pseudo-Voigt functions, while the baseline is estimated using a penalised cubic spline. This latent continuous representation accounts for differences in resolution between measurements. The posterior distribution can be incrementally updated as more data becomes available, resulting in a scalable algorithm that is robust to local maxima. By incorporating this representation in a Bayesian hierarchical regression model, we can quantify the relationship between molecular concentration and peak intensity, thereby providing an improved estimate of the limit of detection, which is of major importance to analytical chemistry.