基于混合线性变换的语声转换算法

doi:10.3724/SP.J.1146.2006.00787

摘要
图/表
参考文献
相关文章 (15)

全文: PDF (239 KB)
输出: BibTeX | EndNote (RIS)

摘要针对在没有对称语音库的情况下，该文提出了一种基于混合线性变换的语声转换算法，在最大似然估计准则下，使用EM迭代算法计算变换函数的参量。为了减小线性加权对语音谱包络的平滑作用，使用线性调频Z变换来调节语音信号的LPC系数。客观评测和主观感受的实验结果都表明，基于混合线性变换的语声转换算法也可以取得与传统语声转换技术相当的转换效果，解除了传统语声转换技术需要对称语音库的要求。

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	简志华
	杨震

关键词 ：语声转换, 混合线性变换, 最大期望算法, 线性调频Z变换

Abstract：This paper proposes an algorithm for voice conversion based on mixtures of linear transformation which avoids the need for parallel training corpus inherent in conventional approaches. In maximum likelihood framework, the EM algorithm is used to compute the parameters of the transfer function. And the chirp Z-transform is utilized to enhance the smoothed spectral envelop due to the linear weighted averaging. The proposed voice conversion system is evaluated using both objective and subjective measures. The experiment results demonstrate that the proposed approach is capable of effectively transforming speaker identity and can achieve comparable results of the conventional methods where a parallel corpus is needed.

Key words： Voice conversion Ms-LT EM algorithm Chirp Z-transform

收稿日期: 2006-06-06

PACS:

TN912.3

基金资助:

江苏省青蓝工程项目(QL003YZ)资助课题

引用本文:

简志华; 杨震. 基于混合线性变换的语声转换算法[J]. 电子与信息学报, 2007, 29(7): 1700-1702 . Jian Zhi-hua; Yang Zhen. An Algorithm for Voice Conversion Based on Mixtures of Linear Transformation. , 2007, 29(7): 1700-1702 .

链接本文:

http://jeit.ie.ac.cn/CN/10.3724/SP.J.1146.2006.00787 或 http://jeit.ie.ac.cn/CN/Y2007/V29/I7/1700