食品科学 ›› 2020, Vol. 41 ›› Issue (4): 256-261.doi: 10.7506/spkx1002-6630-20190313-160

• 安全检测 • 上一篇    下一篇

正交试验设计优化近红外检测牛乳中蛋白质的建模条件

彭丹,刘亚丽,李林青,毕艳兰   

  1. (河南工业大学粮油食品学院,河南 郑州 450001)
  • 出版日期:2020-02-25 发布日期:2020-03-02
  • 基金资助:
    国家自然科学基金青年科学基金项目(31601537)

Optimization of Modeling Conditions for Near Infrared Measurement of Protein Content in Milk by Orthogonal Array Design

PENG Dan, LIU Yali, LI Linqing, BI Yanlan   

  1. (College of Food Science and Engineering, Henan University of Technology, Zhengzhou 450001, China)
  • Online:2020-02-25 Published:2020-03-02

摘要: 利用二维相关光谱技术对不同蛋白质含量的光谱数据进行分析,明确蛋白质吸收的灵敏光谱区域;分别考察检测波段、预处理方法和建模方法3 个因素对模型预测结果的影响,在此基础上通过正交试验设计优化乳蛋白近红外检测的建模条件,以避免各因素间的交互作用。结果表明,检测波段、预处理方法和建模方法均对蛋白质模型的预测结果有较大影响,经过分析可知因素主次顺序为建模方法>检测波段>预处理方法,其中标准正态变量变换和多元散射校正(multiplicative scatter correction,MSC)能够消除牛乳自身散射作用的干扰,线性的建模方法如主成分回归(principal component regression,PCR)、偏最小二乘法等明显优于非线性的支持向量机建模方法,优化后的建模条件为检测波段1 800~2 300 nm、预处理方法MSC、建模方法PCR,此时蛋白质模型的决定系数(R2)和预测均方根误差分别为0.993、0.106,为后期乳蛋白质含量快速检测设备的开发提供技术支持。

关键词: 牛乳, 近红外光谱, 正交试验设计, 蛋白质

Abstract: In order to improve the accuracy and stability of protein content measurement using near-infrared (NIR) spectroscopy, the spectral data of milk samples with different protein contents were analyzed using the two-dimensional correlation spectroscopy to identify the characteristic wavelengths region of protein. Then, the effects of wavelength bands, preprocessing algorithms and modeling methods on the prediction accuracy of the model were studied by single factor experiments. On this basis, the modeling conditions were optimized by orthogonal array design to avoid interactions. The results showed that all three factors had a great impact on the performance of the prediction model in the descending order of modeling methods, wavelength bands and preprocessing methods. Among the preprocessing algorithms tested, standard normal variable (SNV) algorithm and multiplicative scatter correction (MSC) algorithm could effectively eliminate the interference of scattering. The linear models such as principal component regression (PCR) and partial least squares (PLS) were significantly better than non-linear support vector machine regression (SVR). Finally, the optimized modeling conditions were determined as follows: detection wavelength range from 1 800 to 2 300 nm, MSC preprocessing, and PCR modeling. Under these conditions, the correlation coefficient (R2) and root mean square error of prediction (RMSEP) of the prediction model were 0.993 and 0.106, respectively. This research provides a feasible technical way to develop a new device for the rapid detection of protein content in milk in the future.

Key words: milk, near-infrared spectroscopy, orthogonal array design, protein

中图分类号: