首页 | 本学科首页   官方微博 | 高级检索  
     检索      

Smoothing Technique for Statistical Language Model Based on Global Discount
作者姓名:HUANG Yong-wen  HE Zhong-Shi
摘    要:Smoothing techniques are mainly used to solve the problem of sparse data for statistical language model. The present smoothing techniques deal with the data sparse problem using different discount and compensate strategy, and they have different merit or shortcoming on complexity and rationality. This paper presents a new kind of smoothing technique based on global discount for Bi-gram model. The model parameters, probabilities for bigram, are discounted according to frequency of bigram, and are compensated according to lower-level model for unseen events in the model, whose rationality is indicated by minimizing the perplexity. Experiment results show that the technique is superior to commonly used Katz smoothing technique.

关 键 词:statistical  language  model    smoothing  technique    global  discount    perplexity
收稿时间:4/5/2005 12:00:00 AM
修稿时间:4/5/2005 12:00:00 AM

Smoothing Technique for Statistical Language Model Based on Global Discount
HUANG Yong-wen,HE Zhong-Shi.Smoothing Technique for Statistical Language Model Based on Global Discount[J].Storage & Process,2005(8):51-55.
Authors:HUANG Yong-wen  HE Zhong-Shi
Abstract:Smoothing techniques are mainly used to solve the problem of sparse data for statistical language model. The present smoothing techniques deal with the data sparse problem using different discount and compensate strategy, and they have different merit or shortcoming on complexity and rationality. This paper presents a new kind of smoothing technique based on global discount for Bi-gram model. The model parameters, probabilities for bigram, are discounted according to frequency of bigram, and are compensated according to lower-level model for unseen events in the model, whose rationality is indicated by minimizing the perplexity. Experiment results show that the technique is superior to commonly used Katz smoothing technique.
Keywords:statistical language model  smoothing technique  global discount  perplexity
点击此处可从《保鲜与加工》浏览原始摘要信息
点击此处可从《保鲜与加工》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号