食品科学

• 安全检测 • 上一篇    下一篇

红外光谱结合统计分析对不同产地玛咖的鉴别分类

王元忠,赵艳丽,张 霁,金 航   

  1. 云南省农业科学院药用植物研究所,云南 昆明 650200
  • 出版日期:2016-02-25 发布日期:2016-02-23
  • 通讯作者: 金 航
  • 基金资助:

    国家自然科学基金地区科学基金项目(31460538;81260608);云南省自然科学基金项目(2013FD066;2013FZ150)

Classification of Different Origins of Maca Based on Infrared Spectroscopy in Combination with Statistical Analysis

WANG Yuanzhong, ZHAO Yanli, ZHANG Ji, JIN Hang   

  1. Institute of Medicinal Plants, Yunnan Academy of Agricultural Sciences, Kunming 650200, China
  • Online:2016-02-25 Published:2016-02-23
  • Contact: JIN Hang

摘要:

采用傅里叶变换红外光谱法,对采自云南及秘鲁共139 份玛咖样品进行产地鉴别研究。采用多元散射校正结合二阶导数和Norris平滑预处理光谱,通过剔除噪声明显的光谱波段,筛选出适宜的主成分数为8。基于最优主成分数,采用间隔偏最小二乘(interval partial least-squares,iPLS)法对3 650.59~651.82 cm-1光谱进行优化分析。结果显示,筛选98 份样品在1 855.19~651.822、3 054.69~2 756.78 cm-1和3 650.59~3 353.6 cm-1光谱建立的间隔偏最小二乘判别分析(interval partial least-squares discriminant analysis,iPLS-DA)分类模型,其R2、校正均方根误差和预测均方根误差分别为0.958 4、0.785 8和1.164 2。通过41 份样品验证,验证正确率与原光谱建立的分类模型保持一致,均为87.80%。为进一步提高分类模型的精度,在iPLS筛选的光谱波段基础上,分别采用遗传算法(geneticalgorithm,GA)和蛙跳算法(shuffled frog leaping algorithm,SFLA)对光谱信息进行优化,结果显示,采用GA筛选频率大于4和5的光谱信息,筛选的光谱数据点分别为62 个和29 个;利用SFLA筛选概率大于0.1和0.15的光谱信息,筛选的光谱数据点分别为77 个和27 个。验证结果显示,采用GA-PLS-DA(62 个数据点)和GA-PLS-DA(29 个数据点)建立的PLS-DA分类模型识别正确率分别为95.12%和97.56%,采用SFLA-PLS-DA(77 个数据点)和SFLA-PLS-DA(27 个数据点)建立的分类模型识别正确率分别为92.68%和97.56%。对比上述方法可知,采用iPLS-DA、GA-PLS-DA和SFLA-PLSDA建立的分类模型均具有较好的预测性能,其中GA-PLS-DA(29 个数据点)和SFLA-PLS-DA(27 个数据点)建立分类模型能更准确地鉴别不同产地的玛咖。该方法的建立为玛咖红外光谱产地鉴别提供一种新的思路,所筛选的光谱变量可为不同产地玛咖内在化学成分(组分)差异性分析提供基础依据。

关键词: 玛咖, 红外光谱, 间隔偏最小二乘法, 遗传算法, 蛙跳算法

Abstract:

Based on Fourier transform infrared spectroscopy (FTIR), identification of the origin of 139 samples of maca
collected from Yunnan and Peru was conducted. The infrared spectra were preprocessed by multiple scattering correction
combined with second derivative and Norris smoothing. Through eliminating the noise spectral bands, the suitable number
of principal components was chose as eight. Based on the optimal number of principal components, by using interval partial
least squares (iPLS), the spectra in the range of 3 650.59–651.82 cm–1 was processed by optimization analysis. An iPLS-DA
classification model was built by screening the spectra of 98 samples in the ranges of 1 855.19–651.822, 3 054.69–2 756.78
and 3 650.59–3 353.6 cm–1. The R2, RMSEC and RMSEP of the model were 0.958 4, 0.785 8 and 1.164 2, respectively.
The verification with 41 samples indicated that the validation accuracy was consistent with that of the classification model
built using the original spectra, which was 87.80%. To further improve the accuracy of the classification model on the basis
of iPLS screening of spectral bands, the spectral information was optimized by genetic algorithm (GA) and shuffled frog
leaping algorithm (SFLA), respectively. The results showed that, through GA screening the frequency of spectral information
which was greater than 4 and 5, the filtered spectral data points were 62 and 29, respectively. Through SFLA screening the
probability of spectral information which was greater than 0.1 and 0.15, the filtered spectral data points were 77 and 27,
respectively. The validation results showed that the recognition efficiency of the classification model built by GA-PLS-DA
(62 data points) and GA-PLS-DA (29 data points) were 95.12% and 97.56%, respectively. The recognition efficiency of the
classification model built by SFLA-PLS-DA (77 data points) and SFLA-PLS-DA (27 data points) were 92.68% and 97.56%.
By comparing the above methods, we could find that the classification models built by iPLS-DA, GA-PLS-DA and SFLAPLS-
DA all had good prediction performance, of which the models built by GA-PLS-DA (29 data points) and SFLA-PLSDA
(27 data points) could more accurately identify the different origins of maca. The methods could provide a new way for
identification of the origin of maca with IR. The screening of the spectral variables could provide the basis for the difference
analysis of the chemical constitutes (components) in different origins of maca.

Key words: maca (Lepidium meyenii Walp.), infrared spectroscopy, interval partial least squares, genetic algorithm, shuffled frog leaping algorithm

中图分类号: