食品科学

• 分析检测 • 上一篇    下一篇

基于不同PLS算法的方竹笋中蛋白质分析的近红外光谱特征波段选择

黄 维,田丰玲,刘振尧,杨 琼,赵小辉,杨季冬   

  1. 1.西南大学化学化工学院,重庆 400715;2.长江师范学院化学化工学院,重庆 408100;
    3.重庆三峡学院化学与环境工程学院,重庆 404000
  • 出版日期:2013-11-25 发布日期:2013-12-05

Wavelength Selection for FT-NIR Spectroscopic Analysis of Protein in Chimonobambusa quadrangularis Shoot Based on iPLS and BiPLS Models

HUANG Wei,TIAN Feng-ling,LIU Zhen-yao,YANG Qiong,ZHAO Xiao-hui,YANG Ji-dong   

  1. 1. College of Chemistry and Chemical Engineering, Southwest University, Chongqing 400715, China;
    2. College of Chemistry and Chemical Engineering, Yangtze Normal University, Chongqing 408100, China;
    3. College of Chemistry and Environmental Engineering, Chongqing Three Gorges University, Chongqing 404000, China
  • Online:2013-11-25 Published:2013-12-05

摘要:

利用近红外光谱法对金佛山方竹笋的蛋白质分析,采用间隔偏最小二乘法(PLS)与反向间隔偏最小二乘法(BiPLS),实现蛋白质光谱特征波段选择。将全波段分划分为12与17个波段,对全波段和每个小波段分别用PLS回归建模,然后优化组合各个区间,建立BiPLS模型,用交互验证均方差(RMSECV)与预测均方差(RMSEP)对模型进行评价。结果表明:iPLS与 BiPLS的效果均优于基于全波段的PLS模型,尤以BiPLS模型效果最佳。当间隔数为12时,所选特征波段5、3、6、12、4、2、11建立的模型效果最佳,其交互验证均方差RMSECV与预测均方差RMSEP分别为0.321%、0.218%。该方法快速无损,有效地减少建模的变量数,使模型预测精度得到提高。

关键词: 近红外光谱, 蛋白质, 方竹笋, 间隔偏最小二乘法, 反向区间偏最小二乘法, 波段优选

Abstract:

Objective: To select the characteristic wavelength bands for non-destructive analysis of proteins in
Chimonobambusa quadrangularis shoot by FT-NIR spectroscopy combined with interval partial least squares (iPLS) and
backward interval partial least squares (BiPLS). Methods: The full wavelength range was divided into 12 or 17 bands. PLS
models were developed based on the full wavelength range or on each band and then BiPLS models were built by optimized
combinations of various intervals. The models were evaluated by the mean square error of cross validation (RMSECV) and
root mean square error of prediction (RMSEP). Results: The iPLS and BiPLS models were both more effective than the PLS
models based on the full wavelength range and the BiPLS model was the best one. When 12 intervals were adopted, the
combination of 7 ([5, 3, 6, 12, 4, 2 and 11]) spectral intervals provided the best model showing a 0.321% RMSECV and a
0.218% RMSEP. The proposed method allowed rapid and non-destructive determination of proteins with less variables used
in the modeling and thus improved prediction accuracy.

Key words: near infrared spectroscopy, protein, Chimonobambusa quadrangularis, interval partial least squares, backward interval partial least squares, wavelength selection