食品科学 ›› 2023, Vol. 44 ›› Issue (12): 343-350.doi: 10.7506/spkx1002-6630-20220727-304

• 安全检测 • 上一篇    下一篇

结合太赫兹光谱与机器学习的小麦霉变程度判别

杨承霖,刘嘉祺,郭芸成,徐志远,张思怡,姚志凤,秦立峰,陈煦,何东健,卫亚红   

  1. (1.西北农林科技大学机械与电子工程学院,陕西 杨凌 712100;2.农业农村部农业物联网重点实验室,陕西 杨凌 712100;3.陕西省农业信息感知与智能服务重点实验室,陕西 杨凌 712100;4.西北农林科技大学生命科学学院,陕西 杨凌 712100)
  • 出版日期:2023-06-25 发布日期:2023-06-30
  • 基金资助:
    中国博士后科学基金面上项目(2020M673503);陕西省自然科学基础研究计划一般项目(青年)(2021JQ-145); 陕西省重点研发计划项目(2021NY-169;2020NY-101)

Detection of Mildew Degree of Wheat Using Terahertz Spectroscopy and Machine Learning

YANG Chenglin, LIU Jiaqi, GUO Yuncheng, XU Zhiyuan, ZHANG Siyi, YAO Zhifeng, QIN Lifeng, CHEN Xu, HE Dongjian, WEI Yahong   

  1. (1. College of Mechanical and Electronic Engineering, Northwest A&F University, Yangling 712100, China; 2. Key Laboratory of Agricultural Internet of Things, Ministry of Agriculture and Rural Affairs, Yangling 712100, China; 3. Shaanxi Key Laboratory of Agricultural Information Perception and Intelligent Service, Yangling 712100, China; 4. College of Life Sciences, Northwest A&F University, Yangling 712100, China)
  • Online:2023-06-25 Published:2023-06-30

摘要: 为快速、准确地判断小麦籽粒的霉变程度,研究基于太赫兹时域光谱技术,结合支持向量机(support vector machine,SVM)、随机森林(random forest,RF)和极限学习机(extreme learning machine,ELM)的霉变小麦定性分析方法。首先,将小麦籽粒分为正常、轻度霉变、中度霉变和重度霉变4 类,利用CCT-1800太赫兹时域光谱仪获取小麦样本在0.1~4.0 THz波段的光谱数据。对比采用不同光谱预处理方法对判别结果的影响后,使用主成分分析、线性判别分析(linear discriminant analysis,LDA)、t分布随机近邻嵌入3 种方法对光谱数据进行降维,结果表明LDA的降维效果最好。最后,构建基于SVM、RF和ELM的小麦霉变程度判别模型,结果显示SVM的判别效果最好,当核函数选择多项式核、误差惩罚系数为1时,判别准确率高达98.61%,预测集均方根误差值为0.142 9。本研究表明利用太赫兹光谱技术可实现小麦霉变程度的准确检测,为食品安全和粮食贮藏检测提供一种检测手段。

关键词: 太赫兹时域光谱;霉变小麦;光谱预处理;光谱降维;光谱分类

Abstract: In order to judge the mildew degree of wheat seeds quickly and accurately, this study proposed a qualitative analysis method for moldy wheat using terahertz time-domain (THz-TDS) spectroscopy combined with support vector (SVM), random forest (RF) or extreme learning machine (ELM). According to the content of aflatoxin B1 (AFB1), wheat seeds were classified into four types: normal, slight mildew, moderate mildew and severe mildew. Spectral data in the band of 0.1–4.0 THz were obtained using a CCT-1800 THz spectrometer. The effects of different spectral pretreatment methods on the results discrimination were examined, and three dimensionality reduction methods, principal component analysis (PCA), linear discriminant analysis (LDA), and t-distributed stochastic neighbor embedding (t-SNE), were used to reduce the dimension of spectral data. LDA was found to be the best method. Finally, a model based on SVM, RF or ELM was constructed. The SVM model had the best classification effect. When polynomial kernel function was chosen and error penalty coefficient was 1, the accuracy of discrimination was 98.61% and the root mean square error of prediction was 0.142 9. This study confirms that THz spectroscopy can be applied for accurate detection of wheat mildew, which can provide a detection approach for food safety and grain storage and detection.

Key words: terahertz time-domain spectroscopy; moldy wheat; spectral preprocessing; spectral dimension reduction; spectral classification

中图分类号: