文本无关说话人识别中一种改进的模型PCA变换方法

doi:10.3724/SP.J.1146.2005.00749

摘要
图/表
参考文献
相关文章 (15)

全文: PDF (228 KB)
输出: BibTeX | EndNote (RIS) 背景资料

摘要对于采用高斯混合模型(GMM)的与文本无关的说话人识别，出于模型参数数量和计算量的考虑 GMM的协方差矩阵通常取为对角矩阵形式，并假设观察矢量各维之间是不相关的。然而，这种假设在大多情况下是不成立的。为了使观察矢量空间适合于采用对角协方差的GMM进行拟合，通常采用对参数空间或模型空间进行解相关变换。该文提出了一种改进模型空间解相关的PCA方法，通过直接对GMM的各高斯成分的协方差进行主成分分析，使参数空间分布更符合使用对角化协方差的混合高斯分布，并通过共享PCA变换阵的方法减少参数数量和计算量。在微软语音库上的说话人识别实验表明，该方法取得了比常规的对角协方差GMM系统的最优结果有相对35%的误识率下降。

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	姚志强
	周曦
	戴蓓蒨

关键词 ：话者识别, PCA, 模型PCA, 解相关

Abstract：There is a basic choice in the form of covariance matrix to be used with Gaussian mixture model in text-independent speaker identification. In general, diagonal covariance matrix is chose, which implies strong assumption that elements of the feature vector are independent, because full covariance matrix suffers from too many parameters and large computational requirement. Unfortunately, in most application the assumption is not reasonable. In order to make feature vectors more suit to be modeled with diagonal covariance, features are usually de-correlated in feature space or model space. In this paper, an improved model-based PCA transformation algorithm is presented to de-correlate the elements of feature vectors. In this algorithm, principal component analysis is directly made for covariance of Gaussians. Also, the number of parameter is deduced through tying the PCA transformation between Gaussians. Experiments on the MSRA mandarin task show that the algorithm could achieve above 35% identification error reduction over the best diagonal covariance models.

Key words： Speaker Identification Principle Component Analysis(PCA) Model-based PCA De-correlation

收稿日期: 2005-06-27

PACS:

TP391.42

基金资助:

国家自然科学基金(60272039)资助课题

引用本文:

姚志强; 周曦; 戴蓓蒨. 文本无关说话人识别中一种改进的模型PCA变换方法[J]. 电子与信息学报, 2007, 29(2): 469-472 . Yao Zhi-qiang; Zhou Xi; Dai Bei-qian . Improved Model-Based PCA Transformation for GMM in Speaker Identification. , 2007, 29(2): 469-472 .

链接本文:

http://jeit.ie.ac.cn/CN/10.3724/SP.J.1146.2005.00749 或 http://jeit.ie.ac.cn/CN/Y2007/V29/I2/469