中国血吸虫病防治杂志 ›› 2024, Vol. 36 ›› Issue (6): 555-561.

• 论著 • 上一篇    下一篇

基于EfficientNet⁃B4模型的云南省湖北钉螺滇川亚种视觉智能识别模型的建立

白少文1, 2,周济华3,董毅3,张键锋2,施亮2*,杨坤1, 2*   

  1. 1 南京医科大学公共卫生学院(江苏 南京 211166);2 国家卫生健康委员会寄生虫病预防与控制技术重点实验室、江苏省寄生虫与媒介控制技术重点实验室、江苏省血吸虫病防治研究所(江苏 无锡 214064);3 云南省地方病防治所、云南省自然疫源性疾病防控重点实验室
  • 出版日期:2024-12-25 发布日期:2024-12-31
  • 作者简介:白少文,男,硕士研究生。研究方向:寄生虫病防治与人工智能
  • 基金资助:
    国家自然科学基金(82173586,82373644);江苏省卫生健康委员会医学科研项目(x202302,M2021102);江苏省无锡市科技局“太湖之光”科技攻关项目(Y20212048)

Construction of a visual intelligent identification model for Oncomelania hupensis robertsoni in Yunnan Province based on the EfficientNet⁃B4 model

BAI Shaowen1, 2, ZHOU Jihua3, DONG Yi3, ZHANG Jianfeng2, SHI Liang2*, YANG Kun1, 2*   

  1. 1 School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 211166, China; 2 National Health Commission Key Laboratory of Parasitic Disease Prevention and Control, Jiangsu Provincial Key Laboratory on Parasite and Vector Control Technology, Jiangsu Institute of Parasitic Diseases, Wuxi, Jiangsu 214064, China; 3 Yunnan Institute of Endemic Disease Control and Prevention, Yunnan Key Laboratory of Natural Epidemic Disease Prevention and Control Technology, Dali, Yunnan 671000, China
  • Online:2024-12-25 Published:2024-12-31
  • Contact: 杨坤yangkun@jipd.com;施亮jipd1950sl@163.com

摘要: 目的 建立基于EfficientNet⁃B4模型的云南省湖北钉螺滇川亚种视觉智能识别模型,并评估不同数据增强方法和模型超参数对钉螺识别效果的影响。方法 2024年6月,于云南省永胜县采集湖北钉螺和拟钉螺样本各400只,各选取300只,鉴别分类后进行图像样本采集。将采集的925张钉螺和1 062张拟钉螺图像作为数据集,按照8∶2的比例分为训练集和验证集;对剩余的100只钉螺和100只拟钉螺样本分别采集352张和354张图像作为外部测试集。对采集的图像进行裁剪、调整大小等预处理操作。采用基线(baseline)、Mixup和高斯模糊等3种数据增强方法;模型超参数包括自适应矩估计(adaptive moment estimation,Adam)和梯度下降法(stochastic gradient descent,SGD)2种优化器,焦点损失函数(focal loss)和交叉熵损失函数(cross entropy loss)2种损失函数以及余弦退火(cosine annealing)和多间隔调整(multi⁃step)2种学习率衰减策略。基于EfficientNet⁃B4模型建立对湖北钉螺滇川亚种和拟钉螺的智能识别模型,并将不同数据增强方法和不同超参数组合为7个不同训练策略组,采用外部测试集对模型性能进行测试。采用准确率、精确率、召回率、F1指数、损失值、约登指数和受试者工作特征(receiver operator characteristic,ROC)曲线下面积(area under curve,AUC)等指标评价不同训练策略下模型性能。结果 采用不同数据增强方法的各组模型间损失值差异较接近。同时采用Mixup和高斯模糊数据增强方法的第4组模型性能最佳,外部测试集测试准确率为90.38%、精确率为90.07%、F1指数为89.44%、约登指数为0.81、AUC为0.961。采用SGD优化器的组别模型准确率较采用Adam优化器的组别降低29.16%([χ2] = 81.325,P < 0.001),采用交叉熵损失函数模型准确率较第4组降低0.80%([χ2] = 3.147,P > 0.05),采用多间隔调整学习率衰减策略的模型准确率较第4组提高0.65%([χ2] = 0.208,P > 0.05)。采用基线+ Mixup +高斯模糊的数据增强策略与自适应矩估计+焦点损失函数+多间隔调整学习率衰减策略的超参数配置模型性能最高,外部测试集测试准确率为91.03%、精确率为91.97%、召回率为88.11%、F1指数为90.00%、约登指数为0.82、AUC为0.969。结论 基于EfficientNet⁃B4模型的云南省湖北钉螺滇川亚种智能识别模型可实现钉螺和拟钉螺的准确鉴别。

关键词: 湖北钉螺, 拟钉螺, 深度学习, 人工智能, 计算机视觉, 云南省

Abstract: Objective To construct a visual intelligent recognition model for Oncomelania hupensis robertsoni in Yunnan Province based on the EfficientNet⁃B4 model, and to evaluate the impact of data augmentation methods and model hyperparameters on the recognition of O. hupensis robertsoni. Methods A total of 400 O. hupensis robertsoni and 400 Tricula snails were collected from Yongsheng County, Yunnan Province in June 2024, and snail images were captured following identification and classification of 300 O. hupensis robertsoni and 300 Tricula snails. A total of 925 O. hupensis robertsoni images and 1 062 Tricula snail images were collected as a dataset and divided into a training set and validation set in a ratio of 8∶2, while 352 images captured from the other 100 O. hupensis robertsoni and 354 images from 100 Tricula snails served as an external test set. All acquired images were subjected to preprocessing, including cropping and resizing. Three data augmentation approaches were employed, including baseline, Mixup and Gaussian blurring, and model hyperparameters included two optimization algorithms of adaptive moment estimation (Adam) and stochastic gradient descent (SGD), two loss functions of focal loss and cross entropy loss, and two learning rate decay strategies of cosine annealing and multi⁃step. The intelligent recognition models of O. hupensis robertsoni and Tricula snails were constructed based on the EfficientNet⁃B4 model, and 7 training strategy groups were generated by combinations of different data augmentation approaches and hyperparameters. The performance of intelligent recognition models was tested with external test sets, and evaluated with accuracy, precision, recall, F1 score, loss, Youden's index, and the area under the receiver operating characteristic curve (AUC) under different training strategies. Results The variation of loss values was comparable among intelligent recognition models with different data augmentation approaches. The Group 4 model constructed with Mixup and Gaussian blurring data augmentation approaches showed the optimal performance, with an accuracy of 90.38%, precision of 90.07%, F1 score of 89.44%, Youden's index of 0.81 and AUC values of 0.961 in the external test set. The accuracy of models using the SGD optimizer reduced by 29.16% as compared to those using the Adam optimizer ([χ2] = 81.325, P < 0.001), and the accuracy of models using the cross entropy loss function reduced by 0.80% as compared to the Group 4 model ([χ2] = 3.147, P > 0.05), while the accuracy of models using the multi⁃step learning rate decay strategy increased by 0.65% as compared to the Group 4 model ([χ2] = 0.208, P > 0.05). In addition, the model with the baseline + Mixup + Gaussian blurring data augmentation approach and hyperparameters of Adam optimizer, focal loss function and multi⁃step learning rate decay strategy showed the highest performance, with an accuracy of 91.03%, precision of 91.97%, recall of 88.11%, F1 score of 90.00%, Youden's index of 0.82 and AUC values of 0.969 in external test set, respectively. Conclusions The intelligent recognition model of O. hupensis robertsoni based on EfficientNet⁃B4 model is accurate for identification of O. hupensis robertsoni and Tricula snails in Yunnan Province.

Key words: Oncomelania hupensis, Tricula, Deep learning, Artificial intelligence, Computer vision, Yunnan Province

中图分类号: