基于隐藏单元条件随机场的多知识源融合改进自动语音识别置信度

doi:10.3724/SP.J.1146.2013.01614

Abstract
Figure/Table
References
Related Citation (15)

Download: PDF (269 KB)
Export: BibTeX | EndNote (RIS)

Abstract As to the difficulty of confidence measure estimation regarding to Automatic Speech Recognition (ASR), a strategy resorting to multi-source knowledge combination to improve the confidence measure is proposed in this paper. More specially, the knowledge come from acoustic level, linguistic level and semantic level are firstly selected and then combined by different ways by held-out validation. And then, these multi-source knowledge are integrated under the framework of Hidden-units Conditional Random Fields (HuCRFs). Lastly, the conditional probability computed from HuCRFs is used to be a new estimation procedure of confidence measure for recognition candidate. Experiments show that the discriminative ability of conditional probability of HuCRFs is superior to the conventional posterior computed from lattice. Furthermore, a lattice rescoring is carried out by utilizing the conditional probabilities of HuCRFs to search the best hypotheses and resulted in a significant reduction on Character Error Rate (CER) by about 2% absolutely on a benchmark corpus. Simultaneously, a performance comparison between the performances of long-distance language model based lattice rescoring and conditional probability of HuCRFs based lattice rescoring is also performed and it is further proved that HuCRFs is a better alternative to the estimation of confidence measure in ASR.

Key words： Speech recognition confidence measure Multi-source knowledge combination Hidden-units Conditional Random Fields (HuCRFs) Lattice rescoring

Received: 21 October 2013

PACS:

TP391.42

Corresponding Authors: Gao Xing-long E-mail: gaoxinglong9999@163.com

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	Gao Xing-Long
	Pan Jie-Lin
	Yan Yong-Hong

Cite this article:

Gao Xing-Long,Pan Jie-Lin,Yan Yong-Hong. The Confidence Measure Improvement by Combining Multi-source Knowledge Based on Hidden-units Conditional Random Fields in Automatic Speech Recognition[J]. , 2014, 36(8): 1852-1858.

URL:

http://jeit.ie.ac.cn/EN/10.3724/SP.J.1146.2013.01614 OR http://jeit.ie.ac.cn/EN/Y2014/V36/I8/1852