基于集成学习优化的肉制品安全风险等级预警分析
CSTR:
作者:
作者单位:

作者简介:

穆书敏(1996-),女,硕士研究生,研究方向:大数据分析与处理,E-mail:shuminmu@whut.edu.cn 通讯作者:文红(1971-),女,硕士,教授级高级工程师,研究方向:食品安全风险评估与预警,E-mail:604962461@qq.com;共同通讯作者:桂预风(1963-),男,博士,教授,研究方向:应用统计与机器学习,E-mail:guiyufeng@whut.edu.cn

通讯作者:

中图分类号:

基金项目:

国家重点研发计划项目(2018YFC1603602)


Safety Risk Level Early Warning of Meat Products Based on Optimized Ensemble Learning
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    该研究依据2013~2017年肉制品抽检数据构造了5个安全风险等级,使用特征构造及独热编码进一步关联与肉制品安全相关的影响因素,构建极端梯度提升树算法(Extreme Gradient Boosting,XGBOOST)研究食品生产过程各类因素对于食品安全风险等级的影响程度,并使用多个指标评价模型。此外通过上采样解决样本不平衡问题、贝叶斯优化调节超参数,来提高模型性能及分类效果。相较于模型决策树(Decision Tree,DT)和随机森林(Random Forest,RF),XGBOOST模型在肉制品安全风险等级分类中的表现效果最佳。研究结果表明,食品生产过程环节错综复杂,使用one-hot encoding处理后的模型能够有效判断出各类因素对于食品安全风险等级的影响程度,集成模型中RF的学习效果比较稳定,XGBOOST经过参数调节后准确率等指标得到有效的提升且优于RF。不同采样下XGBOOST的平均精确率均能达到89.14%,平均F1值为88.59%,说明XGBOOST在肉制品安全风险等级预警中适用性,为日常抽检提供技术指导。

    Abstract:

    Five safety risk levels were established based on the detection data of meat products sampled between 2013 and 2017. Feature construction and one-hot encoding were used to further correlate factors relevant to meat product safety. An extreme gradient boosting (XGBOOST) model was established to study the influence levels of various factors during food production on the safety risk level; subsequently then multiple indices were used to evaluate the model. In addition, sample imbalance problem was solved by upsampling, and the hyperparameters were adjusted by Bayesian optimization to improve the model performance and classification results. Simultaneously, the model constructed was compared with the decision tree (DT) and random forest (RF) methods to evaluate their classification performance. The XGBOOST outperformed others in classifying the safety risk levels of meat products. Food production processes are complex, and this study shows that model processing with one-hot encoding could effectively identify the influence levels of various factors on food safety. Moreover, the result suggested that XGBOOST performs better in terms of total accuracy and other indices after parameter adjustment, compared to the RF model, while RF had the most stable learning performance. The average of accuracy and F1 score can reach 89.14% and 88.59%, respectively, under different sampling. The results suggest that XGBOOST can be applied to determine safety risk levels of meat products and provide technical support for daily supervision.

    参考文献
    相似文献
    引证文献
引用本文

穆书敏,陈锂,尹佳,郭鹏程,陈晨,董曼,赵锦,徐晴雪,文红*,桂预风*.基于集成学习优化的肉制品安全风险等级预警分析[J].现代食品科技,2023,39(8):273-286.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2022-07-13
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2023-09-08
  • 出版日期:
文章二维码