食品科学 ›› 2020, Vol. 41 ›› Issue (8): 152-158.doi: 10.7506/spkx1002-6630-20190504-013

• 生物工程 • 上一篇    下一篇

出芽短梗霉发酵液中聚苹果酸定量近红外模型的建立与应用

张英昊,薛照阳,赵廷彬,殷海松,乔长晟,   

  1. (1.天津科技大学生物工程学院,天津 300457;2.天津慧智百川生物工程有限公司,天津 300457;3.天津现代职业技术学院生物工程学院,天津 300350;4.天津市食品绿色制造及安全校企协同创新实验室,天津 300457)
  • 出版日期:2020-04-25 发布日期:2020-04-20
  • 基金资助:
    滨海-中关村知行卓越创新创业实验室建设项目(17YFCZZC00310); 天津市食品绿色制造及安全校企协同创新实验室建设项目(17PTSYJC00080); 工业微生物优良菌种选育与发酵技术公共服务平台项目(17PTGCCX00190)

Construction and Application of a Predictive Model for Determination of Polymalic Acid in the Fermentation Broth of Aureobasidium pullulans by Near-Infrared Spectroscopy

ZHANG Yinghao, XUE Zhaoyang, ZHAO Tingbin, YIN Haisong, QIAO Changsheng,   

  1. (1. College of Biotechnology, Tianjin University of Science & Technology, Tianjin 300457, China; 2. Tianjin HuizhiBiotrans Biological Engineering Co. Ltd., Tianjin 300457, China;3. School of Bioengineering, Tianjin Modern Vocational Technology College, Tianjin 300350, China;4. Tianjin University-Enterprise Collaboration Innovation Laboratory of Food Green Manufacturing and Safety, Tianjin 300457, China)
  • Online:2020-04-25 Published:2020-04-20

摘要: 利用高效液相色谱法测量发酵液聚苹果酸浓度。联合使用间隔偏最小二乘法与移动窗口偏最小二乘法定位建模波段为5 638~6 024 cm-1。依次使用多元散射校正+标准正规化+Savitzky-Golay 55点平滑+一阶导数光谱的预处理方法结合前5维因子偏最小二乘回归建模,可以达到最佳的拟合精度,模型内部验证集预测均方误差(root mean square error of prediction,RMSEP)为1.553 g/L,Rp为0.970 0,外部验证集RMSEP为1.378 g/L,Rp为0.992 4,配对t检验在95%置信度下的最大偏差分别为1.48 g/L和0.83 g/L,模型对同一样品的3 次预测平行无显著差异。将模型应用于单因素培养基优化和诱变菌株样品的预测,结合配对t检验发现模型的预测误差较大,但是可以在样品间模型计算值偏差分别大于3.19 g/L和1.44 g/L的前提下可靠地比较聚苹果酸浓度的大小,从而验证近红外模型在培养基组分优化和诱变菌株筛选领域中的应用价值。

关键词: 近红外建模, 偏最小二乘法, 聚苹果酸, 单因素培养基优化, 诱变菌种筛选, 配对t-检验

Abstract: High performance liquid chromatography (HPLC) was used to measure the polymalic acid concentration in the fermentation broth of Aureobasidium pullulans. Interval partial least square regression (iPLSR) combined with moving window PLSR (MWPLSR) was used to confirm 5 638–6 024 cm-1 as the waveband for modeling. Multiplicative scatter correlation (MSC), standard normal variate (SNV), Savitzky-Golay 55 points smoothing and 1st derivative spectrum were successively operated as spectral pre-processing methods, and then PLSR with the first 5 factors was used to develop a predictive model with the highest accuracy. The root mean square of prediction (RMSEP) and correlation coefficient of prediction (Rp) of the model were 1.553 g/L and 0.970 0 for the internal test set, and 1.378 g/L and 0.992 4 for the external test set, respectively, and a paired t-test at 95% confidence level demonstrated that the maximum deviations between the HPLC values and the model predicted values were calculated as 1.48 and 0.83 g/L, respectively. There was no significant difference among three parallel predictions for one sample. When the model was applied to medium composition optimization and mutant screening, large prediction errors were found with paired t-test. However, the model could reliably predict polymalic acid concentration under the premise that the model calculated values were larger than 3.19 and 1.44 g/L, respectively, indicating the potential application of the near-infrared model in medium composition optimization and strain screening.

Key words: near-infrared modeling, partial least square, polymalic acid, medium optimization, strain screening, paired t-test

中图分类号: