食品科学 ›› 2024, Vol. 45 ›› Issue (1): 198-203.doi: 10.7506/spkx1002-6630-20230518-174

• 安全检测 • 上一篇    

三维荧光光谱融合小波包分解融合Fisher判别分析及支持向量机识别紫苏

任永杰,殷勇,于慧春,袁云霞   

  1. (河南科技大学食品与生物工程学院,河南 洛阳 471023)
  • 发布日期:2024-02-05
  • 基金资助:
    “十三五”国家重点研发计划重点专项(2017YFC1600802)

Identification of Perilla Based on Three-Dimensional Fluorescence Spectra Using Wavelet Packet Decomposition, Fisher Discriminant Analysis and Support Vector Machine

REN Yongjie, YIN Yong, YU Huichun, YUAN Yunxia   

  1. (College of Food and Bioengineering, Henan University of Science and Technology, Luoyang 471023, China)
  • Published:2024-02-05

摘要: 为实现紫苏品种的快速鉴别,避免以次充好,选取4 个品种的紫苏采集三维荧光数据,提出了一种基于小波包分解融合Fisher判别分析(Fisher discriminant analysis,FDA)的荧光数据特征选择策略,并实施了4 种紫苏的有效鉴别。首先,对三维荧光数据进行预处理,采用Delaunay三角形内插值法去除瑞利散射和拉曼散射,以消除它们的不利影响;运用Savitzky-Golar卷积平滑对数据进行平滑处理,以减少噪声的干扰。同时,对三维荧光数据进行初步筛选,去除了荧光强度小于0.01的发射波长。然后,对各激发波长对应的发射光谱进行3 层sym4小波包分解,计算得到最低频段的小波包能量值,作为各激发波长光谱数据表征量。接着,再利用FDA对小波包能量进行判别分析,将其所包含的差异性信息进行融合,得到FDA生成的新变量,并选取累计判别能力达到99%的前3 个FD变量作为不同品种差异性信息的表征变量,提出三维荧光数据的表征策略。最后,利用BP神经网络(back propagation neural network,BPNN)和支持向量机(support vector machine,SVM)两种模式识别算法对表征变量进行分析,得到FDA+BPNN和FDA+SVM两种鉴别结果。FDA+BPNN的训练集正确率为97.5%,测试集正确率为95%;FDA+SVM的训练集和测试集的正确率均达到98.33%。结果表明,三维荧光光谱技术结合小波包分解、FDA和SVM算法基本上能够实现紫苏品种的鉴别。这为后续有关紫苏的进一步检测研究(如某些有效成分的定量检测)提供了研究基础。

关键词: 紫苏;三维荧光;小波包分解;Fisher判别分析;BP神经网络;支持向量机

Abstract: In order to rapidly identify perilla species and avoid passing off, three-dimensional (3D) fluorescence spectral data of perilla from four regions in China were acquired. A feature selection strategy of fluorescence data based on wavelet packet decomposition fused with Fisher discriminant analysis (FDA) was proposed, and effective identification of the four species of perilla was implemented. First, the 3D fluorescence data were preprocessed by using Delaunay triangle interpolation to remove the adverse influence of Rayleigh scattering and Raman scattering; Savitzky-Golar (SG) convolutional smoothing was applied to smooth the data for the purpose of reducing the interference of noise. At the same time, the 3D fluorescence data were initially screened to remove emission wavelengths with fluorescence intensity less than 0.01. Second, the 3-layer sym4 wavelet packet decomposition of the emission spectrum corresponding to each excitation wavelength was performed, and the wavelet packet energy value of the lowest frequency band was calculated as the amount of spectral data characterization for each excitation wavelength. Third, FDA was used for discriminant analysis of these wavelet packet energy values, and the discrepancy information contained in them was fused to obtain the new variables generated by FDA; the first three FD variables with 99% cumulative discriminative power were selected as variables for the characterization of the discrepancy information of different species, and then a characterization strategy for the 3D fluorescence data was proposed. Finally, two pattern recognition algorithms, back propagation neural network (BPNN) and support vector machine (SVM), were used to analyze the characterization variables, and identification results were obtained with FDA + BPNN and FDA + SVM. A correct rate of 97.5% for the training set and 95% for the test set was observed with FDA + BPNN, and the correct rate obtained with FDA + SVM for both the training and test sets was 98.33%. These results showed that 3D fluorescence spectroscopy combined with wavelet packet decomposition, FDA and SVM algorithms could basically identify perilla from different regions, which will provide a basis for further research on perilla, such as quantitative detection of some active components.

Key words: perilla; three-dimensional fluorescence; wavelet packet decomposition; Fisher discriminant analysis; back propagation neural network; support vector machine

中图分类号: