食品科学 ›› 2026, Vol. 47 ›› Issue (8): 384-395.doi: 10.7506/spkx1002-6630-20251013-060

• 安全检测 • 上一篇    下一篇

基于可见-近红外光谱的多策略特征融合新梅货架期判别分析

张慧,文龙,刘建江,李晓娟,邹湘军,焦海峰   

  1. (新疆大学智能制造现代产业学院,新疆 乌鲁木齐 830017)
  • 出版日期:2026-04-25 发布日期:2026-05-15
  • 基金资助:
    新疆自治区自然科学基金项目(2022D01C674)

Shelf-Life Classification of Xinjiang Plums Using Multi-strategy Feature Fusion Based on Visible-Near Infrared Spectroscopy

ZHANG Hui, WEN Long, LIU Jianjiang, LI Xiaojuan, ZOU Xiangjun, JIAO Haifeng   

  1. (College of Intelligent Manufacturing Modern Industry, Xinjiang University, ürümqi 830017, China)
  • Online:2026-04-25 Published:2026-05-15

摘要: 本研究利用可见-近红外(visible-near infrared spectroscopy,Vis-NIR)光谱技术采集新梅4 个不同货架期(采后1、3、5、7 d)的光谱信息,采用Savitzky-Golay(SG)卷积平滑、标准正态变量变换(standard normal variate transformation,SNV)、基线校正(baseline correction,BC)和最大-最小归一化(min-max normalization,MN)4 种预处理方法,并利用竞争性自适应重加权采样算法(competitive adaptive reweighted sampling,CARS)、变量空间迭代收缩法(variable iterative space shrinkage approach,VISSA)和连续投影算法(successive projections algorithm,SPA)分别提取特征波长,与波段比、波段差和归一化强度差(normalized spectral intensity difference,NSID)3 种光谱形态特征形成组合特征集,用于构建偏最小二乘判别分析(partial least squares-discriminant analysis,PLS-DA)与极限学习机(extreme learning machine,ELM)分类模型。结果表明,基于全光谱建模时,PLS-DA与ELM判别模型判别性能有限,验证集准确率均仅为70.24%;而采用MN预处理并融合SPA特征提取方法与NSID形态特征的MN-SPA-NSID-ELM模型,在仅使用24 个变量数与30 个隐含层神经元的条件下,验证集准确率达97.62%,显著优于其他模型组合。此外,在独立制备的新梅外部测试样本集上,该模型仍实现了99.18%的准确率,Kappa系数为0.989,有效地提升了新梅果品货架期分类判别效率,本研究可为新梅生产与分级环节提供快速、精准、无损的检测技术支持。

关键词: 新梅;可见-近红外光谱;货架期;多策略特征融合;机器学习

Abstract: In this study, visible-near infrared (Vis-NIR) spectra of Xinjiang plums at four different shelf-life stages (1, 3, 5, and 7 d after harvest) were collected. Four spectral preprocessing methods, Savitzky-Golay (SG) smoothing, standard normal variate transformation (SNV), baseline correction (BC), and min-max normalization (MN), were compared to develop shelf-life classification models using partial least squares-discriminant analysis (PLS-DA) or extreme learning machine (ELM). For the development of PLS-DA and ELM models, feature wavelength extraction methods, namely competitive adaptive reweighted sampling (CARS), variable iterative space shrinkage approach (VISSA), and successive projection algorithm (SPA), were separately combined with three spectral features, namely band ratio (BR), band difference (BD), and normalized spectral intensity difference (NSID), thereby forming combined feature sets. The results showed that the PLS-DA and ELM models based on the full spectrum exhibited limited classification performance with a validation accuracy of only 70.24%. In contrast, the MN-SPA-NSID-ELM model, which integrated MN preprocessing, SPA-based feature extraction, and NSID, achieved a validation accuracy of 97.62% using only 24 selected variables and 30 hidden layer neurons, significantly outperforming the other combined models. In addition, on an independent external test set of Xinjiang plums, this model still achieved an accuracy of 99.18% with a Kappa coefficient of 0.989, indicating improved shelf-life classification efficiency. This study provides a rapid, accurate, and nondestructive technique for the production and grading of Xinjiang plums.

Key words: Xinjiang plums; visible-near infrared spectroscopy; shelf-life; multi-strategy feature fusion; machine learning

中图分类号: