您的位置: 专家智库 > >

国家自然科学基金(2010GZS0072)

作品数:2 被引量:4H指数:1
发文基金:国家自然科学基金更多>>
相关领域:自动化与计算机技术电子电信更多>>

文献类型

  • 2篇中文期刊文章

领域

  • 2篇自动化与计算...
  • 1篇电子电信

主题

  • 1篇SPEECH...
  • 1篇VARI
  • 1篇BASED_...
  • 1篇IMPROV...
  • 1篇LANGUA...
  • 1篇GRAM

传媒

  • 2篇Journa...

年份

  • 2篇2012
2 条 记 录,以下是 1-2
排序方式:
Vari-gram language model based on word clustering
2012年
Category-based statistic language model is an important method to solve the problem of sparse data.But there are two bottlenecks:1) The problem of word clustering.It is hard to find a suitable clustering method with good performance and less computation.2) Class-based method always loses the prediction ability to adapt the text in different domains.In order to solve above problems,a definition of word similarity by utilizing mutual information was presented.Based on word similarity,the definition of word set similarity was given.Experiments show that word clustering algorithm based on similarity is better than conventional greedy clustering method in speed and performance,and the perplexity is reduced from 283 to 218.At the same time,an absolute weighted difference method was presented and was used to construct vari-gram language model which has good prediction ability.The perplexity of vari-gram model is reduced from 234.65 to 219.14 on Chinese corpora,and is reduced from 195.56 to 184.25 on English corpora compared with category-based model.
袁里驰
Improved hidden Markov model for speech recognition and POS tagging被引量:4
2012年
In order to overcome defects of the classical hidden Markov model (HMM), Markov family model (MFM), a new statistical model was proposed. Markov family model was applied to speech recognition and natural language processing. The speaker independently continuous speech recognition experiments and the part-of-speech tagging experiments show that Markov family model has higher performance than hidden Markov model. The precision is enhanced from 94.642% to 96.214% in the part-of-speech tagging experiments, and the work rate is reduced by 11.9% in the speech recognition experiments with respect to HMM baseline system.
袁里驰
共1页<1>
聚类工具0