食品科学 ›› 2026, Vol. 47 ›› Issue (7): 345-352.doi: 10.7506/spkx1002-6630-20251028-217

• 安全检测 • 上一篇    下一篇

面向可见/近红外光谱在线分选的昭通冰糖心苹果多机器学习模型融合堆叠检测方法

祝翔,章晓玉,刘智,勒德祥,陈楠   

  1. (1.华东交通大学机电与车辆工程学院,江西 南昌 330000;2.华东交通大学 水果智能光电检测技术与装备国家地方联合工程研究中心,江西 南昌 330000)
  • 出版日期:2026-04-15 发布日期:2026-05-08
  • 基金资助:
    国家自然科学基金地区科学基金项目(32560602);“十四五”国家重点研发计划重点专项(2022YFD2001804); 江西省研究生创新专项(YC2025-S126)

Development of a Multi-machine Learning Model Fusion and Stacking Approach for Online Sorting of Zhaotong Sugar-Heart Apples Using Visible/Near-Infrared Spectroscopy

ZHU Xiang, ZHANG Xiaoyu, LIU Zhi, LE Dexiang, CHEN Nan   

  1. (1. School of Mechatronics and Vehicle Engineering, East China Jiaotong University, Nanchang 330000, China;2. National and Local Joint Engineering Research Center of Fruit Intelligent Photoelectric Detection Technology and Equipment, East China Jiaotong University, Nanchang 330000, China)
  • Online:2026-04-15 Published:2026-05-08

摘要: 针对昭通冰糖心苹果糖心发生的不确定性,以及不同糖心程度对果实风味与贮藏期影响不同的问题,本研究旨在实现有无糖心及不同程度糖心苹果的快速、无损精准分选,从而为果实商业价值提升及贮存分级提供技术支持。使用实验室自研的水果在线分选机,采集昭通冰糖心苹果的可见/近红外光谱信号。以变量投影重要性特征波长筛选方法得到的特征波长作为输入,建立随机森林、极限梯度提升、逻辑回归和支持向量机融合堆叠模型并与单一模型的性能进行对比。结果表明,与单一模型相比融合了4 种不同机器学习算法的堆叠模型表现出极佳的识别性能(准确率95.47%、召回率94.82%、特异度97.83%)。所提出的模型堆叠方法可在不大幅增加计算量的同时融合各基础模型优势,显著提升模型的预测性能,在水果在线分选应用中展现了广阔前景。

关键词: 冰糖心苹果;堆叠;变量投影重要性波段筛选;无损检测;可见/近红外光谱

Abstract: In view of the uncertainty of sugar heart occurrence in Zhaotong apples and the varying effects of different sugar heart degrees on fruit flavor and storage life, this study aimed to develop a rapid and non-destructive method for accurately sorting apples with or without different degrees of sugar heart, thereby providing technical support for enhancing the commercial value of fruits and optimizing their storage and grading. This study employed a laboratory-developed online fruit sorting system to acquire visible/near infrared (VIS/NIR) spectral signals of Zhaotong sugar-heart apples. Feature wavelengths identified by the variable importance in projection (VIP) method served as inputs for establishing a stacking ensemble model integrating random forest (RF), extreme gradient boosting (XGBoost), logistic regression (LR), and support vector machine (SVM) algorithms. The performance of the stacking model was compared against that of standalone models. The results showed that the stacking model integrating four distinct machine learning algorithms exhibited superior recognition performance with an accuracy of 95.47%, a true positive rate (TPR) of 94.82%, and a true negative rate (TNR) of 97.83% compared with standalone models. The proposed stacking ensemble approach significantly enhances predictive capability by combining the strengths of base models without substantially increasing the computational load, showing great potential for applications in online fruit sorting.

Key words: sugar-heart apples; stacking; waveband selection by variable importance in projection; non-destructive testing; visible/near-infrared spectroscopy

中图分类号: