一种两步判决的说话人分割算法

doi:10.3724/SP.J.1146.2009.01072

摘要
图/表
参考文献
相关文章 (15)

全文: PDF (200 KB)
输出: BibTeX | EndNote (RIS)

摘要为了提高说话人分割(SS)准确率，该文综合考虑了静音信息和性别信息在SS中的作用，提出了一种两步判决的SS算法。在从音频流中分离出语音段的基础上，采用两步判决的方法进行SS。第1步采用基频信息为主、性别模型为辅的策略进行SS，将相邻说话人基频差异大的说话人改变检测出来；第2步采用基于性别的改进T²判决公式进行SS，实现相邻说话人基频差异小的同性别SS，为此，该文提出了一个基于块的潜在说话人改变点检测算法。实验结果表明，本文算法提高了分割准确率，F₁度量值可达85.14%。对于短时长(<2 s)语音段的SS，该算法和传统的贝叶斯信息判决算法相比，漏检率减少了16%。

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	杨继臣
	贺前华
	李艳雄
	王伟凝

关键词 ：语音信号处理, 两步判决, 说话人分割, 基频信息, 性别信息

Abstract：To improve the precision of Speaker Segmentation (SS), this paper propose a two-step SS algorithm by making use of silence and gender information. Two-step criterion is used to decide the Speaker Change Point (SCP) within detected speech segmentations. In the first step, pitch difference between different speakers and gender model are used to locate the SCP within neighboring speech segments; In the second step, a gender-based modified T²criterion formula is used to locate SCP among the same gender speakers, and potential speaker change point is detected based on chunk. The experiment results show that the proposed algorithm improved SS precision and F₁ can reach 85.14%. For SS with duration less than 2 s, the algorithm can reduce missed detection rate of about 16%, compared with Bayesian information Criterion.

Key words： Speech signal processing Two-step criterion Speaker Segmentation (SS) Pitch information Gender information

收稿日期: 2009-08-10

PACS:

TN912.3

基金资助:

国家自然科学基金(60972132，60602014)资助课题

通讯作者: 杨继臣 E-mail: nisonyoung@yahoo.cn

引用本文:

杨继臣, 贺前华, 李艳雄, 王伟凝. 一种两步判决的说话人分割算法[J]. 电子与信息学报, 2010, 32(8): 2006-2009. Yang Ji-Chen, He Qian-Hua, Li Yan-Xiong, Wang Wei-Ning. A Two-step Criterion Algorithm of Speaker Segmentation. , 2010, 32(8): 2006-2009.

链接本文:

http://jeit.ie.ac.cn/CN/10.3724/SP.J.1146.2009.01072 或 http://jeit.ie.ac.cn/CN/Y2010/V32/I8/2006