首页 | 本学科首页   官方微博 | 高级检索  
     检索      

融合YOLOv7和BYTE多目标跟踪的多类别海珍品计数方法
引用本文:安志强,李智军,刘硕,赵永刚,陈启俊,左然涛,林远山.融合YOLOv7和BYTE多目标跟踪的多类别海珍品计数方法[J].农业工程学报,2023,39(9):183-189.
作者姓名:安志强  李智军  刘硕  赵永刚  陈启俊  左然涛  林远山
作者单位:大连海洋大学信息工程学院, 大连 116023;辽宁省海洋信息技术重点实验室, 大连 116023;设施渔业教育部重点实验室, 大连海洋大学, 大连 116023;大连鑫玉龙海洋生物种业科技股份有限公司, 大连 116007;大连海洋大学水产与生命学院, 大连 116023
基金项目:辽宁省教育厅基本科研项目(LJKZ0730,QL202016);辽宁省自然基金资助计划(2020-KF-12-09);辽宁省重点研发计划(2020JH2/10100043);设施渔业教育部重点实验室开放课题(202219);辽宁省应用基础计划项目(2022JH2/101300187);2023中央财政对辽宁渔业补助项目
摘    要:针对目前养殖过程中海珍品计数方法成本高、效率低、计数精度难以保障等问题,该研究以真实底播养殖环境下的海珍品为研究对象,以水下拍摄的海珍品视频为数据源,提出一种基于视频多目标跟踪的多类别海珍品计数方法。首先,采用性能优异的YOLOv7算法实现海珍品目标检测器,为多目标跟踪提供输入;然后,结合真实养殖环境下同类别海珍品外观相似性高、不清晰等特点,借鉴BYTE算法的多目标跟踪思想,设计多类别轨迹生成策略和基于轨迹ID号的计数策略,提出一种多类别海珍品跟踪与计数方法。并提出一套更适用于基于轨迹ID号计数方法的评估指标。试验结果表明,改进平均计数精度、改进平均绝对误差、改进均方根误差及帧率分别为91.62%、5.75、6.38和32帧/s,各项指标多优于YOLOv5+DeepSORT、YOLOv7+DeepSORT、YOLOv5+BYTE、YOLOv7+BYTE等算法,尤其改进平均计数精度、帧率指标比YOLOv5+DeepSORT高了29.51个百分点和8帧/s,且在改进平均绝对误差、改进均方根误差指标上分别降低19.50和12.08。该研究方法可有效帮助水产养殖企业掌握水下海珍品数量,为现代化...

关 键 词:机器视觉  深度学习  海珍品计数  水产养殖  多目标跟踪
收稿时间:2023/1/17 0:00:00
修稿时间:2023/3/7 0:00:00

Multi-category sea foods counting method integrating YOLOv7 and BYTE multi-target tracking
AN Zhiqiang,LI Zhijun,LIU Shuo,ZHAO Yonggang,CHEN Qijun,ZUO Rantao,LIN Yuanshan.Multi-category sea foods counting method integrating YOLOv7 and BYTE multi-target tracking[J].Transactions of the Chinese Society of Agricultural Engineering,2023,39(9):183-189.
Authors:AN Zhiqiang  LI Zhijun  LIU Shuo  ZHAO Yonggang  CHEN Qijun  ZUO Rantao  LIN Yuanshan
Institution:School of Information Engineering, Dalian Ocean University, Dalian 116023, China;Key Laboratory of Marine Information Technology of Liaoning Province, Dalian 116023, China;Key Laboratory of Environment Controlled Aquaculture (KLECA), Ministry of Education, Dalian Ocean University, Dalian 116023, China;Dalian Xinyulong Marine Biological Seed Industry Technology Co., Ltd, Dalian 116007, China;School of Fisheries and Life Sciences, Dalian Ocean University, Dalian 116023, China
Abstract:Accurate and rapid counting has been one of the most important steps on sea foods in modern aquaculture. Taking the sea foods in a real bottom-sowing aquaculture environment as the research object, this study aims to guarantee the cost-saving, highly efficient and easy counting on the multi-category sea foods using video multi-target tracking. The underwater videos of sea foods were taken as the data source. Firstly, the YOLOv7 algorithm with excellent performance was used to realize the target detector of sea treasure, providing the input for the multi-target tracking. Specifically, the image data was captured under real underwater using the "2017 Underwater Robot Picking Contest, URPC2017", and then used to train the YOLOv7 model. 17,655 images contained the holothurians, echinus, and scallops. The images were randomly divided into the training set of 14055 images in the ratio of 8:1:1, the verification set of 1800 images, and the test set of 1800 images. The training setting of the YOLOv7 detector was set as the image adaptive size of 640*640, the initial learning rate of 0.01, the momentum of 0.9, the weight decay of 0.0005, the batch size of 16, the number of training rounds of 300. One test was performed every 10 rounds of training round. The operating system was selected the Ubuntu18.04 for Haizhenpin detector training, the deep learning framework was PyTorch, the experimental processor was AMD Ryzen Threadripper 1920X 12 cores, and the graphics card was NVIDIA GeForce RTX2080. Secondly, the characteristics of high similarity were combined with the unclear appearance of the same type of sea foods in the real breeding environment. The multi-target tracking of the BYTE algorithm was used for the reference. A multi-category trajectory generation and a counting strategy were then designed for the sea treasure tracking using the trajectory ID number. Finally, an optimal combination of the indicators was also evaluated on the performance of the improved model. The test results show that the average counting precision, mean absolute error (MAE), root mean square error (RMSE), and frame rate were 91.62%, 5.75, 6.38, and 32 frames/s, respectively. All indicators were better than those of the current YOLOv5+DeepSORT, YOLOv7+DeepSORT, YOLOv5+BYTE, and YOLOv7+BYTE. Especially, the average counting accuracy and frame rate index ratio of YOLOv5+DeepSORT were improved by 29.51, and 28 percentage points than before. The MAE and RMSE of the improved model were reduced by 19.50 and 12.08, respectively. The quantity of underwater sea foods was effectively measured in the modern fishery. The finding can provide the technical reference for the production measurement and the scientific decision-making on the intelligent management of aquaculture. In addition, the underwater dataset was used to train the detection model under the same environment, in order to reduce the false detections caused by different conditions. Nevertheless, the underwater environment changes greatly, due to various influencing factors, such as lighting. Statistical counting was then performed to collect the video of marine treasures under the same environment every time during training. Therefore, the current model can be expected to improve on the different underwater environments in the future.
Keywords:machine vision  deep learning  sea foods counting  aquaculture  multi-target tracking
点击此处可从《农业工程学报》浏览原始摘要信息
点击此处可从《农业工程学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号