首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于太赫兹时域光谱技术的烟草组分识别
引用本文:周博,朱文魁,王赵改,蒋鹏飞,刘洪坤,李智慧,张柯,付丽丽.基于太赫兹时域光谱技术的烟草组分识别[J].农业工程学报,2022,38(10):310-316.
作者姓名:周博  朱文魁  王赵改  蒋鹏飞  刘洪坤  李智慧  张柯  付丽丽
作者单位:1. 中国烟草总公司郑州烟草研究院,郑州 450000;;2. 河南省农业科学院农副产品加工中心,郑州 450000
基金项目:国家烟草专卖局重点研发科技项目(110202102009);科技部对发展中国家常规性科技援助项目(KY202002007)
摘    要:为了准确识别不同烟草配方组分,利用太赫兹时域光谱技术,针对烟草工业常用的叶丝、梗丝和再造烟叶丝3种烟草配方组分开展太赫兹光谱特性分析和分类识别方法研究。对0.35~1.50 THz范围内3种烟丝的吸收系数谱和折射率谱进行分析,通过低方差滤波结合主成分分析(Principal Component Analysis,PCA)进行光谱特征提取和降维,分别建立针对吸收谱和折射谱的支持向量机(Support Vector Machine,SVM)分类模型、最邻近分类(K-Nearest Neighbor,KNN)模型和袋装树(Bagged trees)分类模型。结果表明,基于吸收系数谱的分类模型准确率最高,低方差滤波结合PCA的特征提取算法能显著提高分类效果,其中KNN模型准确率达到98.3%。对频域光谱使用连续投影算法(Successive Projections Algorithm,SPA)特征提取并结合SVM模型,分类准确率也在90%左右。研究表明太赫兹时域光谱技术可应用于不同烟草组分的分类判别,为太赫兹技术在烟草物料无损检测的应用提供参考。

关 键 词:光谱  模型  烟草组分  太赫兹时域光谱  分类识别  最近邻分类  低方差滤波
收稿时间:2021/12/15 0:00:00
修稿时间:2022/4/15 0:00:00

Identification of tobacco materials based on terahertz time-domain spectroscopy
Zhou Bo,Zhu Wenkui,Wang Zhaogai,Jiang Pengfei,Liu Hongkun,Li Zhihui,Zhang Ke,Fu Lili.Identification of tobacco materials based on terahertz time-domain spectroscopy[J].Transactions of the Chinese Society of Agricultural Engineering,2022,38(10):310-316.
Authors:Zhou Bo  Zhu Wenkui  Wang Zhaogai  Jiang Pengfei  Liu Hongkun  Li Zhihui  Zhang Ke  Fu Lili
Institution:1. Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450000, China;;2. Agricultural and Sideline Products Processing Research Center of Henan Academy of Agricultural Sciences, Zhengzhou 450000, China
Abstract:Abstract: Tobacco leaves can often be classified in detail by variety, stalk position, and place of production. Different tobacco components include leaf shreds, stem shreds, and reconstituted tobacco leaf shreds. It is a high demand to classify and identify the tobacco components in recent years. This present work aims to analyze the absorption coefficient spectrum and refractive index spectrum of three tobacco components in the range of 0.35-1.50 THz using the terahertz time-domain spectroscopy. The low-variance filtering combined with Principal Component Analysis (PCA) was performed for the spectral feature extraction and dimension reduction on spectroscopy data. Three classification models were developed to determine the specific absorption and refraction spectra of tobacco, including Support Vector Machine (SVM), K-Nearest Neighbor (KNN), and Bagged trees. The results show that a higher accuracy was achieved in the classification model using the absorption coefficient spectrum. The low variance filter combined with the PCA feature extraction significantly improved the classification accuracy, and the KNN model presented an accuracy rate of 98.3%. Furthermore, the Successive Projections Algorithm (SPA) feature extraction was also utilized for the frequency domain spectrum combined with the SVM model, where the classification accuracy was also about 90%. Consequently, the terahertz time-domain spectroscopy technology can be expected to serve the classification of cut tobacco. The finding can provide a strong reference for the application of terahertz time-domain spectroscopy to the non-destructive detection of tobacco materials.
Keywords:spectroscopy  models  tobacco components  terahertz time-domain spectroscopy  classification and recognition  K-nearest neighbor classification  low-variance filtering
点击此处可从《农业工程学报》浏览原始摘要信息
点击此处可从《农业工程学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号