一种稳健的基于Visemic LDA的口形动态特征及听视觉语音识别

Abstract
Figure/Table
References
Related Citation (15)

Download: PDF (1526 KB)
Export: BibTeX | EndNote (RIS)

Abstract This paper presents a robust visual feature based on Visemic LDA for audio visual speech recognition, which captures dynamic lip contour information and reflects the viseme classes of visual speech. The paper also introduces an automatic labeling method using the speech recognition results for LDA training data, which avoids the tedious manually labeling work and labeling errors. Experimental results show that the audio visual speech recognition system based on the visual features presented in this paper can greatly increase the speech recognition rate in noisy conditions. The combination of the visual feature with multi-stream HMM can bring the recognition rate of over 80% at a 10dB SNR noisy condition.

Key words： Speech recognition Audio visual speech recognition ASM Linear Discriminant Analysis （LDA） Viseme

Received: 11 July 2003

PACS:

TP391.42

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors







	Xie Lei
	Fu Zhong-hua
	Jiang Dong-mei
	Zhao Rong-chun
	Werner Verhelst
	Hichem Sahli
	Jan Conlenis

Cite this article:

Xie Lei,Fu Zhong-hua,Jiang Dong-mei等. A Robust Dynamic Mouth Feature Based on Visemic LDA for Audio Visual Speech Recognition[J]. , 2005, 27(1): 64-68 .

URL:

http://jeit.ie.ac.cn/EN/ OR http://jeit.ie.ac.cn/EN/Y2005/V27/I1/64