基于EfficientNet⁃B4模型的云南省湖北钉螺滇川亚种视觉智能识别模型的建立

中国血吸虫病防治杂志 ›› 2024, Vol. 36 ›› Issue (6): 555-561.

基于EfficientNet⁃B4模型的云南省湖北钉螺滇川亚种视觉智能识别模型的建立

白少文1, 2，周济华3，董毅3，张键锋2，施亮2*，杨坤1, 2*

1 南京医科大学公共卫生学院（江苏南京 211166）；2 国家卫生健康委员会寄生虫病预防与控制技术重点实验室、江苏省寄生虫与媒介控制技术重点实验室、江苏省血吸虫病防治研究所（江苏无锡 214064）；3 云南省地方病防治所、云南省自然疫源性疾病防控重点实验室

出版日期:2024-12-25 发布日期:2024-12-31
作者简介:白少文，男，硕士研究生。研究方向：寄生虫病防治与人工智能
基金资助:
国家自然科学基金（82173586，82373644）；江苏省卫生健康委员会医学科研项目（x202302，M2021102）；江苏省无锡市科技局“太湖之光”科技攻关项目（Y20212048）

Construction of a visual intelligent identification model for Oncomelania hupensis robertsoni in Yunnan Province based on the EfficientNet⁃B4 model

BAI Shaowen1, 2, ZHOU Jihua3, DONG Yi3, ZHANG Jianfeng2, SHI Liang2*, YANG Kun1, 2*

1 School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 211166, China; 2 National Health Commission Key Laboratory of Parasitic Disease Prevention and Control, Jiangsu Provincial Key Laboratory on Parasite and Vector Control Technology, Jiangsu Institute of Parasitic Diseases, Wuxi, Jiangsu 214064, China; 3 Yunnan Institute of Endemic Disease Control and Prevention, Yunnan Key Laboratory of Natural Epidemic Disease Prevention and Control Technology, Dali, Yunnan 671000, China

Online:2024-12-25 Published:2024-12-31
Contact: 杨坤yangkun@jipd.com；施亮jipd1950sl@163.com

摘要/Abstract

摘要： 目的　建立基于EfficientNet⁃B4模型的云南省湖北钉螺滇川亚种视觉智能识别模型，并评估不同数据增强方法和模型超参数对钉螺识别效果的影响。方法　2024年6月，于云南省永胜县采集湖北钉螺和拟钉螺样本各400只，各选取300只，鉴别分类后进行图像样本采集。将采集的925张钉螺和1 062张拟钉螺图像作为数据集，按照8∶2的比例分为训练集和验证集；对剩余的100只钉螺和100只拟钉螺样本分别采集352张和354张图像作为外部测试集。对采集的图像进行裁剪、调整大小等预处理操作。采用基线（baseline）、Mixup和高斯模糊等3种数据增强方法；模型超参数包括自适应矩估计（adaptive moment estimation，Adam）和梯度下降法（stochastic gradient descent，SGD）2种优化器，焦点损失函数（focal loss）和交叉熵损失函数（cross entropy loss）2种损失函数以及余弦退火（cosine annealing）和多间隔调整（multi⁃step）2种学习率衰减策略。基于EfficientNet⁃B4模型建立对湖北钉螺滇川亚种和拟钉螺的智能识别模型，并将不同数据增强方法和不同超参数组合为7个不同训练策略组，采用外部测试集对模型性能进行测试。采用准确率、精确率、召回率、F1指数、损失值、约登指数和受试者工作特征（receiver operator characteristic，ROC）曲线下面积（area under curve，AUC）等指标评价不同训练策略下模型性能。结果　采用不同数据增强方法的各组模型间损失值差异较接近。同时采用Mixup和高斯模糊数据增强方法的第4组模型性能最佳，外部测试集测试准确率为90.38%、精确率为90.07%、F1指数为89.44%、约登指数为0.81、AUC为0.961。采用SGD优化器的组别模型准确率较采用Adam优化器的组别降低29.16%（[χ2] = 81.325，P < 0.001），采用交叉熵损失函数模型准确率较第4组降低0.80%（[χ2] = 3.147，P > 0.05），采用多间隔调整学习率衰减策略的模型准确率较第4组提高0.65%（[χ2] = 0.208，P > 0.05）。采用基线+ Mixup +高斯模糊的数据增强策略与自适应矩估计+焦点损失函数+多间隔调整学习率衰减策略的超参数配置模型性能最高，外部测试集测试准确率为91.03%、精确率为91.97%、召回率为88.11%、F1指数为90.00%、约登指数为0.82、AUC为0.969。结论　基于EfficientNet⁃B4模型的云南省湖北钉螺滇川亚种智能识别模型可实现钉螺和拟钉螺的准确鉴别。

关键词: 湖北钉螺, 拟钉螺, 深度学习, 人工智能, 计算机视觉, 云南省

Abstract: Objective　To construct a visual intelligent recognition model for Oncomelania hupensis robertsoni in Yunnan Province based on the EfficientNet⁃B4 model, and to evaluate the impact of data augmentation methods and model hyperparameters on the recognition of O. hupensis robertsoni. Methods　A total of 400 O. hupensis robertsoni and 400 Tricula snails were collected from Yongsheng County, Yunnan Province in June 2024, and snail images were captured following identification and classification of 300 O. hupensis robertsoni and 300 Tricula snails. A total of 925 O. hupensis robertsoni images and 1 062 Tricula snail images were collected as a dataset and divided into a training set and validation set in a ratio of 8∶2, while 352 images captured from the other 100 O. hupensis robertsoni and 354 images from 100 Tricula snails served as an external test set. All acquired images were subjected to preprocessing, including cropping and resizing. Three data augmentation approaches were employed, including baseline, Mixup and Gaussian blurring, and model hyperparameters included two optimization algorithms of adaptive moment estimation (Adam) and stochastic gradient descent (SGD), two loss functions of focal loss and cross entropy loss, and two learning rate decay strategies of cosine annealing and multi⁃step. The intelligent recognition models of O. hupensis robertsoni and Tricula snails were constructed based on the EfficientNet⁃B4 model, and 7 training strategy groups were generated by combinations of different data augmentation approaches and hyperparameters. The performance of intelligent recognition models was tested with external test sets, and evaluated with accuracy, precision, recall, F1 score, loss, Youden's index, and the area under the receiver operating characteristic curve (AUC) under different training strategies. Results　The variation of loss values was comparable among intelligent recognition models with different data augmentation approaches. The Group 4 model constructed with Mixup and Gaussian blurring data augmentation approaches showed the optimal performance, with an accuracy of 90.38%, precision of 90.07%, F1 score of 89.44%, Youden's index of 0.81 and AUC values of 0.961 in the external test set. The accuracy of models using the SGD optimizer reduced by 29.16% as compared to those using the Adam optimizer ([χ2] = 81.325, P < 0.001), and the accuracy of models using the cross entropy loss function reduced by 0.80% as compared to the Group 4 model ([χ2] = 3.147, P > 0.05), while the accuracy of models using the multi⁃step learning rate decay strategy increased by 0.65% as compared to the Group 4 model ([χ2] = 0.208, P > 0.05). In addition, the model with the baseline + Mixup + Gaussian blurring data augmentation approach and hyperparameters of Adam optimizer, focal loss function and multi⁃step learning rate decay strategy showed the highest performance, with an accuracy of 91.03%, precision of 91.97%, recall of 88.11%, F1 score of 90.00%, Youden's index of 0.82 and AUC values of 0.969 in external test set, respectively. Conclusions　The intelligent recognition model of O. hupensis robertsoni based on EfficientNet⁃B4 model is accurate for identification of O. hupensis robertsoni and Tricula snails in Yunnan Province.

Key words: Oncomelania hupensis, Tricula, Deep learning, Artificial intelligence, Computer vision, Yunnan Province

中图分类号:

R383.24

白少文, 周济华, 董毅, 张键锋, 施亮, 杨坤. 基于EfficientNet⁃B4模型的云南省湖北钉螺滇川亚种视觉智能识别模型的建立[J]. 中国血吸虫病防治杂志, 2024, 36(6): 555-561.

BAI Shaowen, ZHOU Jihua, DONG Yi, ZHANG Jianfeng, SHI Liang, YANG Kun. Construction of a visual intelligent identification model for Oncomelania hupensis robertsoni in Yunnan Province based on the EfficientNet⁃B4 model[J]. Chinese Journal of Schistosomiasis Control, 2024, 36(6): 555-561.

[1]	周艺彪. 人工智能在寄生虫病和寄生虫学领域的应用[J]. 中国血吸虫病防治杂志, 2024, 36(6): 551-554.
[2]	张宗亚, 杜春红, 张云, 王洪琼, 宋静, 周济华, 王丽芳, 孙佳昱, 沈美芬, 陈春琼, 江华, 颜嘉琦, 冯锡光, 王文雅, 钱沛君, 薛靖波, 李石柱, 董毅. 基于随机森林和最大熵模型的云南省钉螺潜在地理分布预测[J]. 中国血吸虫病防治杂志, 2024, 36(6): 562-571,613.
[3]	施倩雯, 沈玲娥, 周靖, 吴敬之. 2016—2023年江苏省苏州市钉螺扩散时空分布特征[J]. 中国血吸虫病防治杂志, 2024, 36(6): 577-583.
[4]	储琼, 查明, 姚金付. 2016—2022年安庆市不同水系钉螺空间分布特征[J]. 中国血吸虫病防治杂志, 2024, 36(6): 614-619.
[5]	朱辉银, 李昱婷, 祝黛芊, 王雅茜, 张锦鸿, 陈绍轩, 马潇远, 王惠迪, 李洪军, 李健. 人工智能辅助寄生虫虫卵检测平台的建立与应用#br#[J]. 中国血吸虫病防治杂志, 2024, 36(6): 643-648.
[6]	周瑜, 杨淑娟, 杨媛, 和雁, 袁淑莲, 陈朝闻, 任天广. 2023年云南省龙陵县居民恙虫病防治知信行现状及影响因素[J]. 中国血吸虫病防治杂志, 2024, 36(5): 507-513.
[7]	崔晓, 宋静 , 李春英, 王洪琼, 杜春红, 沈美芬, 杨早改, 史欣平, 李石柱, 董毅. 云南省血防专业人员钉螺与拟钉螺鉴别影响因素分析[J]. 中国血吸虫病防治杂志, 2024, 36(5): 514-520.
[8]	周雨, 童懿昕, 周艺彪. 机器学习模型在血吸虫病防控中的应用[J]. 中国血吸虫病防治杂志, 2024, 36(5): 535-541.
[9]	宋静, 张宗亚, 沈美芬, 周济华, 李春英, 杨早改, 董毅, 杜春红. 日本血吸虫和中华血吸虫尾蚴外部形态及运动方式比较[J]. 中国血吸虫病防治杂志, 2024, 36(4): 384-387.
[10]	张云, 王丽芳, 冯锡光, 吴明寿, 沈美芬, 江华, 宋静, 孙佳昱, 陈春琼, 颜嘉琦, 张宗亚, 周济华, 董毅, 杜春红. 云南省血吸虫病传播阻断历程与展望[J]. 中国血吸虫病防治杂志, 2024, 36(4): 422-427.
[11]	薛靖波, 夏尚, 李召军, 王心怡, 黄良瑜, 何润超, 李石柱. 基于无人机影像深度学习算法的血吸虫病家畜传染源智能识别研究[J]. 中国血吸虫病防治杂志, 2023, 35(2): 121-.
[12]	陈申, 段磊, 李胜明, 周杰, 周应彩, 杨远志, 刘孟利, 王艳仁, 夏尚, 许静, 吕山. 常德市钉螺种群生态隔离机制的初步探析 [J]. 中国血吸虫病防治杂志, 2023, 35(2): 147-.
[13]	丁春丽, 许建卫, 林祖锐, 许时燕, 崔鑫, 孙维江, 田光强, 李春华, 罗宗圣, 周耀武, 杨亚明. 云南省沧源县班老乡人群疟疾防治知识行为及其影响因素分析[J]. 中国血吸虫病防治杂志, 2023, 35(1): 44-.
[14]	熊涛, 郭锦璐, 卢芳国, 刘佳豪, 郑涛, 李佳珊. 基于网络药理学的中药来源灭螺药物相关靶点筛选 [J]. 中国血吸虫病防治杂志, 2022, 34(6): 588-.
[15]	施亮, 张键锋, 李伟, 杨坤. 人工智能助力热带传染病防控研究 [J]. 中国血吸虫病防治杂志, 2022, 34(5): 445-.