食品科学 ›› 2024, Vol. 45 ›› Issue (7): 0-0.

• 成分分析 •    

基于Fisher判别分析可分性信息融合的马铃薯VC含量高光谱检测方法

郭林鸽1,殷勇1,于慧春1,袁云霞2   

  1. 1. 河南科技大学食品与生物工程学院
    2. 河南科技大学
  • 收稿日期:2023-08-08 修回日期:2024-01-23 出版日期:2024-04-15 发布日期:2024-04-09
  • 通讯作者: 殷勇 E-mail:yinyong@haust.edu.cn
  • 基金资助:
    “十三五”国家重点研发计划重点专项

Hyperspectral detection method of potato vitamin C content based on Fisher discriminant analysis separability information fusion

1, 1,   

  • Received:2023-08-08 Revised:2024-01-23 Online:2024-04-15 Published:2024-04-09

摘要: 为了提高马铃薯维生素C(VC)含量检测结果的准确性和可靠性,提出了一种基于Fisher判别分析(Fisher Discriminant Analysis,FDA)可分性数据融合的检测模型输入变量构造方法。首先,利用高光谱成像技术采集了200个马铃薯的高光谱信息,通过对比6种预处理方法和原始数据的建模结果,确定了多元散射校正为光谱数据的预处理方法。其次,采用竞争性自适应重加权算法(competitive adaptive reweighted sampling,CARS)、连续投影算法(successive projections algorithm,SPA)及CARS-SPA组合算法等3种方法提取相应的特征波长,通过对比分析最终确定了34个有效特征波长。然后,将有效特征波长进行FDA可分性数据融合,根据融合的新变量对样本间差异性判别能力的大小进行筛选,确定构造检测模型的输入变量。最后,分别对FDA融合前后筛选的变量建立偏最小二乘模型和反向传播神经网络(Back Propagation Neural Network,BPNN)模型,并对检测结果进行对比分析。结果表明,将CARS算法提取的34个特征波长进行FDA融合,采用前3个融合变量作为构造检测模型的输入变量时,其所建BPNN模型的相关系数由0.9726提高到了0.9990,均方根误差也由0.7723降低到了0.1727,不仅极大地降低了数据分析维度,而且还提高了检测结果的准确性。因此,基于FDA可分性数据融合构造检测模型输入变量以提高马铃薯VC含量检测结果的准确性是适宜的。

关键词: 高光谱成像, Fisher判别分析, 马铃薯, VC含量检测, 模型

Abstract: In order to improve the accuracy and reliability of the prediction results of potato VC content, a method for constructing input variables of detection model based on Fisher discriminant analysis (FDA) separable data fusion was proposed. Firstly, hyperspectral information of 200 potato samples was collected by hyperspectral imaging technology; and by comparing the modeling results the spectral data before and after the preprocessing by six methods, the multiplicative scatter correction was determined as the optimal preprocessing method. Secondly, competitive adaptive reweighted sampling (CARS), successive projections algorithm (SPA) and CARS-SPA combination algorithm were used to extract the corresponding feature wavelengths, and 34 effective feature wavelengths were finally determined through comparative analysis. Thirdly, the effective feature wavelengths were fused by FDA to achieve the separability of the data, and the fused new variables was screened according to the discrimination capacity for different samples, so as to determine the input variables of the detection model to be constructed. Finally, the partial least squares model and BP neural network model were established for the variables screened before and after FDA fusion, respectively, and the detection results were compared and analyzed. The results show that the correlation coefficient of the BPNN model is increased from 0.9726 to 0.9990, and the root mean square error is also reduced from 0.7723 to 0.1727 when the first three fused variables of 34 feature wavelengths extracted by CARS algorithm after FDA fusion are used as the input variables for the detection model, which not only reduces data analysis dimension, but also improves its detection ability. Therefore, it is suitable and effective to construct input variables of detection model based on FDA separable data fusion to improve the accuracy of potato VC content detection results.

Key words: Hyperspectral imaging, Fisher discriminant analysis, Potato, VC content detection, Model

中图分类号: