首页 | 本学科首页   官方微博 | 高级检索  
     

基于机器学习算法的奶牛疾病预测模型的研究
引用本文:李尚汝,宋佳美,张城瑞,孙雨坤,张永根. 基于机器学习算法的奶牛疾病预测模型的研究[J]. 中国畜牧兽医, 2022, 49(7): 2534-2546. DOI: 10.16431/j.cnki.1671-7236.2022.07.011
作者姓名:李尚汝  宋佳美  张城瑞  孙雨坤  张永根
作者单位:东北农业大学动物科学技术学院, 哈尔滨 150030
基金项目:国家现代农业产业技术体系
摘    要:【目的】评估建立奶牛疾病预测模型的6种机器学习(machine learning, ML)算法的性能及预测变量的重要性。【方法】选取2020年12月至2021年11月,共计944头泌乳牛的生产信息、行为信息作为预测因子,疾病信息作为输出变量,训练并验证模型。将日产奶量、反刍量、活动量、胎次和泌乳天数作为输入变量,利用ML算法建立奶牛疾病的预测模型,评估决策树(Decision Tree, DT) C5.0、CHAID算法、人工神经网络(Artificial Neural Network, ANN)、随机森林(Random Forests, RF)、贝叶斯网络(Bayesian Networks, BN)和逻辑回归(Logistic Regression, LR)6种ML算法的性能,评估预测变量的重要性,以及将胎次和泌乳天数纳入预测变量后模型性能的改善情况。采用敏感性和特异性评估模型性能,按照权重排序评估输入变量对模型预测的重要性。【结果】DT C5.0算法敏感性>85%,特异性>90%,为性能最佳的模型;RF总敏感性为56.8%,对各类牛预测的性能较稳定;ANN、BN、DT...

关 键 词:奶牛  机器学习  疾病预测
收稿时间:2021-11-20

Study on Dairy Cow Disease Prediction Model Based on Machine Learning Algorithm
LI Shangru,SONG Jiamei,ZHANG Chengrui,SUN Yukun,ZHANG Yonggen. Study on Dairy Cow Disease Prediction Model Based on Machine Learning Algorithm[J]. China Animal Husbandry & Veterinary Medicine, 2022, 49(7): 2534-2546. DOI: 10.16431/j.cnki.1671-7236.2022.07.011
Authors:LI Shangru  SONG Jiamei  ZHANG Chengrui  SUN Yukun  ZHANG Yonggen
Affiliation:College of Animal Science and Technology, Northeast Agricultural University, Harbin 150030, China
Abstract:【Objective】 This study was aimed to evaluate 6 kind of machine learning (ML) algorithms which were used to establish a dairy cow disease prediction model, and the importance of predictors. 【Method】 The production information,behavior information and disease information of a total of 944 lactating cows from December 2020 to November 2021 were selected as predictors to train and validated the models.Daily milk production,rumination,activity,parity,and lactation days were used as input variables,machine learning algorithms were used to establish a dairy cow disease prediction model,6 machine learning algorithms including Decision Tree (DT) C5.0,CHAID algorithm,Artificial Neural Network (ANN),Random Forests (RF),Bayesian Networks (BN) and Logistic Regression (LR) were evaluated,the importance of predictors and the improvement of model performance by including parity and lactation days were assessed as predictors.Sensitivity and specificity were used to evaluate the performance of the models,and the importance of input variables for models predictions was evaluated according to the weight ranking.【Result】 The sensitivity of DT C5.0 algorithm was greater than 85%,and the specificity was greater than 90%,which was the model with the best performance.The total sensitivity of RF was 56.8%,and the prediction performance for various types of coe was relatively stable.ANN,BN and DT CHAID had better prediction performance for diseases with a large sample size,up to 74.4%.The correct identification rate of LR for sick cow was less than 40.0%,and most of them were identified as healthy cattle.The sum of daily milk production was the most important predictor of RF,ANN,and LR,and the number of days of lactation was the most important predictor of DT C5.0,CHAID and BN.After adding parity and lactation days,the sensitivity of the model's prediction was significantly improved.【Conclusion】 Using machine learning algorithms to predict dairy cow diseases has shown potential,and among them,DT C5.0 was a more suitable model.What's more,milk production and lactation days were relatively important variables in disease prediction models.In addition,including parity and lactation days as predictors could improve the accuracy of model prediction.
Keywords:dairy cow  machine learning  disease prediction  
点击此处可从《中国畜牧兽医》浏览原始摘要信息
点击此处可从《中国畜牧兽医》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号