首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于隐条件随机场的声调建模及区分性模型权重训练
作者姓名:黄浩  朱杰
作者单位:上海交通大学电子工程系,中国上海200240
基金项目:国家“十一五”科技支撑计划课题“养殖废水资源化与安全回灌关键技术研究”(2006badl7b02).
摘    要:为提高声调识别率,利用隐条件随机场对汉语声调进行建模,通过加入音节内动态特征、音节间动态特征以及段长特征来进一步提高声词识别性能。提出了将声调模型加入大词汇量连续语音识别系统的区分性方法,根据最小音子错误准则区分性训练模型相关的概率权重,对声学模型及声调模型概率进行加权;给出了两种权重组合策略并通过一种平滑方法来克服权重训练过拟合的问题。实验结果表明,基于隐条件随机场声调模型能够显著提高声词识别率以及大词汇量语音识别的识别率,同时与全局模型权重方法比较,区分性的模型权重训练能够在声调模型加入连续语音识别系统之后,进一步提高系统的识别性能。

关 键 词:语音识别  模型  隐条件随机场  最小音子错误
修稿时间:2007/7/5 0:00:00

TONE MODELING BASED ON HIDDEN CONDITIONAL RANDOM FIELDS AND DISCRIMINATIVE MODEL WEIGHT TRAINING
Authors:Huang Hao  Zhu Jie
Abstract:The use of hidden conditional random fields (HCRFs) for tone modeling is explored. The tone recognition performance is improved using HCRFs by taking advantage of intra-syllable dynamic, inter-syllable dynamic and duration features. When the tone model is integrated into continuous speech recognition, the discriminative model weight training (DMWT) is proposed. Acoustic and tone scores are scaled by model weights discriminatively trained by the minimum phone error (MPE) criterion. Two schemes of weight training are evaluated and a smoothing technique is used to make training robust to overtraining problem. Experiments show that the accuracies of tone recognition and large vocabulary continuous speech recognition (LVCSR) can be improved by the HCRFs based tone model. Compared with the global weight scheme, continuous speech recognition can be improved by the discriminative trained weight combinations.
Keywords:speech recognition  models  hidden conditional random fields  minimum phone error
点击此处可从《》浏览原始摘要信息
点击此处可从《》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号