Voice Activity Detection in Complex Environment Based on Hilbert-Huang Transform and Order Statistics Filter
Lu Zhi-mao① Jin Hui① Zhang Chun-xiang② Ren Ming-xi①
①(Information and Communication Engineering College, Harbin Engineering University, Harbin 150001, China) ②(School of Software, Harbin University of Science and Technology, Harbin 150080, China)
Abstract:Hilbert-Huang Transform (HHT) is a fully data driven adaptive non-stationary signal time-frequency analysis method. But the Hilbert energy spectrum curve of speech signal is fluctuate in strong noise environment,it has a great influence to voice activity detection. So an effective voice activity detection algorithm is proposed based on HHT and Order Statistics Filter (OSF) in this paper. This method first decompose noise signal into intrinsic mode functions by empirical mode decomposition. Then the Hilbert energy spectrum is synthesized by adaptive weight selection of each intrinsic mode functions, through OSF to smooth the energy spectrum. Finally, the speech and noise divergence is judged by means of the smoothed energy spectrum. Experimental results show obviously that under complex noisy environment, this method is still able to effectively detect the speech signal, and reduce the error detection rate in low signal to noise ratio conditions.
卢志茂, 金辉, 张春祥, 任明溪. 基于HHT和OSF的复杂环境语音端点检测[J]. 电子与信息学报, 2012, 34(1): 213-217.
Lu Zhi-Mao, Jin Hui, Zhang Chun-Xiang, Ren Ming-Xi. Voice Activity Detection in Complex Environment Based on Hilbert-Huang Transform and Order Statistics Filter. , 2012, 34(1): 213-217.