In order to enhance the whole quality of single channel speech enhancement denoising algorithm, both noise reducing and speech perception are considered to improve the traditional speech enhancement algorithm and many kinds of processing methods are taken to achieve the best optimization effect. Firstly, in the view of parameters estimation, spectrum smoothing algorithm based on weak speech presence is added to the soft decision method based on fixed prior signal-to-noise ratio in order to solve the problem of noise spectrum overestimation. Moreover, the smoothing parameter is dynamically controlled by the speech presence probability in order to enhance the tracing effect of prior signal-to-noise ratio. Secondly, in the view of the speech perception improvement, the harmonic reconstruction method is used to reconstruct the harmonic components in high frequencies of speech section. Phase compensation method and gain smoothing method are also employed to remove the annoying musical noise in speech and silence segment. The experimental results show that compared with the traditional algorithm, the proposed algorithm obtains good performance in both denoising effect and speech quality by introducing parameter estimation improvement module and perceived quality improvement module, and it is suitable for many kinds of noise environment and signal-to-noise ratio conditions.
MARTIN R. Noise power spectral density estimation based on optimal smoothing and minimum statistics[J]. IEEE Transactions on Speech Audio Processing, 2001, 9(5): 504-512.
[2]
COHEN I. Noise estimation by minima controlled recursive averaging for robust speech enhancement[J]. IEEE Signal Processing Letters, 2002, 9(1): 12-15.
[3]
COHEN I. Noise spectrum estimation in adverse environment: improved minima controlled recursive averaging[J]. IEEE Transactions on Speech Audio Processing, 2003, 11(5): 466-475.
[4]
EPHRAIM Y and MALAH D. Speech enhancement using a minimum mean-square error log-spectral amplitude estimator[J]. IEEE Transactions on Acoustics Speech and Signal Processing, 1985, 33(2): 443-445.
[5]
CYRIL P, CLAUDE M, and PASCAL St. Improved signal-to-noise ratio estimation for speech enhancement[J]. IEEE Transactions on Speech and Language Processing, 2006, 14(6): 2098-2108.
[6]
GERKMANN T and HENDRIKS R C. Noise power estimation based on the probability of speech presence[C]. Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New York, 2011: 145-148.
[7]
GERKMANN T, BREITHAUPT C, and MARTIN R. Improved a posteriori speech presence probability estimation based on a likelihood ratio with fixed priors[J]. IEEE Transactions on Audio Speech and Language Processing, 2008, 16(5): 910-919.
[8]
FENG Y and AN B. Noise power spectrum estimation based on weak speech protection for speech enhancement[C]. Proceedings of 12th International Conference on Signal Processing (ICSP), Hangzhou, 2014: 484-487.
[9]
袁文浩. 基于噪声估计的语音增强方法研究[D]. [硕士论文], 华东理工大学, 2013.
YUAN Wenhao. The study of speech algorithms based on noise power spectrum estimation[D]. [Master dissertation], East China University of Science and Technology, 2013.
[10]
PLAPOUS C, MARRO C, and SCALART P. Speech enhancement using harmonic regeneration[C]. Proceedings of International Conference on Acoustics Speech and Signal Processing, Pennsylvania, 2005: 157-160.
[11]
颜丽君. 基于噪声估计和掩蔽效应的语音增强[D]. [硕士论文], 西南交通大学, 2014.
YAN Lijun. Speech enhancement based on noise estimation and masking effect[D]. [Master dissertation], Southwest Jiaotong University, 2014.
[12]
ESCH T and VARY P. Efficient musical noise suppression for speech enhancement systems[C]. Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP), Taipei, 2009: 4409-4412.
[13]
WOJCICKI K, MILACIC M, STARK A, et al. Exploiting conjugate symmetry of the short-time Fourier spectrum for speech enhancement[J]. IEEE Signal Processing Letters, 2008, 15: 461-464.
[14]
ISLAM Md T and SHAHNAZ C. Speech enhancement based on noise-compensated phase spectral[C]. Proceedings of International Conference on Electronic Engineering and Information & Communication Technology (ICEEICT), Yichang, 2014: 1-5.
[15]
PALIWAL K, W?JCICKI K, and SHANNON B. The importance of phase in speech enhancement[J]. Speech Communications, 2011, 53(4): 465-494.
BU Fangliang, WANG Weimin, and DAI Qijun. Optimizing speech enhancement based on noise masked probability[J]. Journal of Electronics & Information Technology, 2005, 27(5): 753-756.
[17]
ALAYA S, ZOGHLAMI N, and LACHIRI Z. Speech enhancement based on perceptual filter bank improvement[J]. International Journal of Speech Technology, 2014, 17(3): 253-258.
[18]
HU Y and LOIZOU P. Evaluation of objective measures for speech enhancement[C]. Proceeding of Interspeech, Pittsburgh, 2006: 1447-1450.
[19]
ZHANG Jie, ZHAO Xiaoqun, and XU Jingyun. Suitability of speech quality evaluation measures in speech enhancement[C]. 2014 International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, 2014: 22-26.