首页 | 本学科首页   官方微博 | 高级检索  
     

基于多注意力机制级联LSTM模型的猪脸表情识别
引用本文:温长吉, 张笑然, 吴建双, 杨策, 李卓识, 石磊, 于合龙. 基于多注意力机制级联LSTM模型的猪脸表情识别[J]. 农业工程学报, 2021, 37(12): 181-190. DOI: 10.11975/j.issn.1002-6819.2021.12.021
作者姓名:温长吉  张笑然  吴建双  杨策  李卓识  石磊  于合龙
作者单位:1.吉林农业大学信息技术学院,长春 130118;2.吉林农业大学智慧农业研究院,长春 130118;3.明尼苏达大学食品、农业与自然资源科学学院,圣保罗 55108;4.吉林大学工程仿生教育部重点实验室,长春 130022
基金项目:国家自然科学基金重点项目(U19A2061);国家重点研发技术专项(2017YFD0502001);国家自然科学基金面上项目(11372155,61472161);吉林省自然科学基金(20180101041JC);吉林省发展与改革委员会产业技术研究与开发项目(2021C044-8);吉林省教育厅科学研究项目(JJKH20190924KJ)。
摘    要:面部表情是传递情感的重要信息,是家畜生理、心理和行为的综合反映,可以用于评估家畜福利.由于家畜面部肌群结构简单,因此家畜面部不同区域的细微变化对于表情的反映较难识别.该研究提出一种基于多注意力机制级联LSTM框架模型(Multi-attention Cascaded Long Short Term Memory,MA-...

关 键 词:模型    表情识别  多注意力机制  多任务级联卷积网络  长短时记忆网络  家畜福利
收稿时间:2021-03-06
修稿时间:2021-05-29

Pig facial expression recognition using multi-attention cascaded LSTM model
Wen Changji, Zhang Xiaoran, Wu Jianshuang, Yang Ce, Li Zhuoshi, Shi Lei, Yu Helong. Pig facial expression recognition using multi-attention cascaded LSTM model[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2021, 37(12): 181-190. DOI: 10.11975/j.issn.1002-6819.2021.12.021
Authors:Wen Changji  Zhang Xiaoran  Wu Jianshuang  Yang Ce  Li Zhuoshi  Shi Lei  Yu Helong
Affiliation:1.College of Information and Technology, Jilin Agricultural University, Changchun 130118, China;2.Institute for the Smart Agriculture, Jilin Agricultural University, Changchun 130118, China;3.College of Food, Agricultural and Natural Resource Sciences, University of Minnesota, Paul, 55108, USA;4.Key Laboratory of Bionic Engineering (Ministry of Education), Jilin University, Changchun 130022, China
Abstract:Abstract: Facial expression recognition has widely been used in various life scenarios, such as medicine, criminology, education, and deep learning. Deep learning also makes this technology highly efficient and accurate at present. Much effort has been made to consider the migration of relatively mature facial recognition to animal expressions. The reason was that animals can also express their emotions through facial expressions, according to zoologists. Once the complex emotions expressed by animals can be understood, the incidence of injuries and illnesses can be early monitored in the freedom of animal expressions, thereby maintaining a happy mood for a long time, without hunger, thirst, and worries in a fully guaranteed life. As such, facial expressions can be expected to evaluate animal welfare, due mainly to a comprehensive reflection of physiology, psychology, and behavior of livestock. However, it is difficult to recognize the subtle changes in different areas of facial expressions, particularly for the simple tissue structure of facial muscles in domestic animals. In this study, a Multi-Attention cascaded Long Short Term Memory (MA-LSTM) model was proposed for the recognition of pig facial expression. The specific procedure was as follows: firstly, a simplified multi-task convolution neural network (SMTCNN) was used to detect and then locate the pig face in the frame image, where the influence of the non-pig face region on the recognition performance was removed. Secondly, a multi-attention mechanism was introduced to characterize various feature channels with different visual information and peak response regions. The facial salient regions caused by the changes of facial expression were captured via clustering the regions with similar peak responses. Then the facial salient regions were used to focus on subtle changes in the pig face. Finally, the convolution and attention features were fused and subsequently input into LSTM to classify the data. Data enhancement was performed on the original dataset, thereby obtaining a self-annotated expression dataset of domestic pigs. The expanded datasets were then utilized in the experiments. The experimental results showed that the recognition accuracy of the module with closing the multi-attention mechanism increased by 6.3 percentage points on average, while the misclassification rate was also reduced significantly, compared with the MA-LSTM model. Additionally, the average recognition accuracy of the MA-LSTM model increased by about 32.6, 18.0, 5.9, and 4.3 percentage points, respectively, compared with commonly-used facial video expression recognition. Four types of expressions were classified in visualization, such as anger, happiness, fear, and neutral. Specifically, there was a more obvious variation in the facial area of domestic pigs that was caused by anger and happiness, where the recognition accuracy was higher than others. Nevertheless, the misclassification rate was also higher, due mainly to the fact that the changes of two areas were relatively similar. In any way, the proposed MA-LSTM model was also verified by all the test data in pig face recognition.
Keywords:models   pig   facial expression recognition   multi-attention mechanism   multi-task cascaded convolutional network   long short-term memory network   animal welfare
本文献已被 万方数据 等数据库收录!
点击此处可从《农业工程学报》浏览原始摘要信息
点击此处可从《农业工程学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号