基于最大似然可变子空间的快速说话人自适应方法

doi:10.3724/SP.J.1146.2011.00839

Abstract
Figure/Table
References
Related Citation (8)

Download: PDF (208 KB)
Export: BibTeX | EndNote (RIS)

Abstract A new rapid speaker adaptation method based on maximum likelihood variable subspace is proposed. A set of bases of the speaker space is obtained by performing Principal Component Analysis (PCA) on the Speaker Dependent (SD) model parameters of the training speakers. Different from conventional subspace based methods, during speaker adaptation, a subset of these bases is dynamically chosen for each speaker using maximum likelihood criteria. The new speaker’s model is constrained in the subspace spanned by those bases. With less free parameters required, the new method can obtain more robust SD model using very little amount of adaptation data. Speech recognition experiments show that the new method can obtain better performance than the eigenvoice method and MLLR method, both in supervised mode and in unsupervised mode.

Key words： Continuous speech recognition Speaker adaptation Eigenvoice Subspace method

Received: 15 August 2011

PACS:

TN912.3

Corresponding Authors: Zhang Wen-lin E-mail: zwlin_2004@163.com

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	Zhang Wen-Lin
	Niu Tong
	Zhang Lian-Hai
	Li Bi-Cheng

Cite this article:

Zhang Wen-Lin,Niu Tong,Zhang Lian-Hai等. Rapid Speaker Adaptation Based on Maximum-likelihood Variable Subspace[J]. , 2012, 34(3): 571-575.

URL:

http://jeit.ie.ac.cn/EN/10.3724/SP.J.1146.2011.00839 OR http://jeit.ie.ac.cn/EN/Y2012/V34/I3/571