|
|
Rapid Speaker Adaptation Based on Maximum-likelihood Variable Subspace |
Zhang Wen-lin Niu Tong Zhang Lian-hai Li Bi-cheng |
Institute of Information Engineering, PLA Information Engineering University, Zhengzhou 450002, China |
|
|
Abstract A new rapid speaker adaptation method based on maximum likelihood variable subspace is proposed. A set of bases of the speaker space is obtained by performing Principal Component Analysis (PCA) on the Speaker Dependent (SD) model parameters of the training speakers. Different from conventional subspace based methods, during speaker adaptation, a subset of these bases is dynamically chosen for each speaker using maximum likelihood criteria. The new speaker’s model is constrained in the subspace spanned by those bases. With less free parameters required, the new method can obtain more robust SD model using very little amount of adaptation data. Speech recognition experiments show that the new method can obtain better performance than the eigenvoice method and MLLR method, both in supervised mode and in unsupervised mode.
|
Received: 15 August 2011
|
|
Corresponding Authors:
Zhang Wen-lin
E-mail: zwlin_2004@163.com
|
|
|
|
|
|
|