YUV空间中基于稀疏自动编码器的无监督特征学习

doi:10.11999/JEIT150557

摘要
图/表
参考文献(23)
相关文章 (15)

全文: PDF (17851 KB)
输出: BibTeX | EndNote (RIS)

摘要

现有无监督特征学习算法通常在RGB色彩空间进行特征提取，而图像和视频压缩编码标准则广泛采用YUV色彩空间。为了利用人类视觉特性和避免色彩空间转换所消耗的计算量，该文提出一种基于稀疏自动编码器在YUV色彩空间进行无监督特征学习的方法。首先在YUV空间随机采集图像子块并进行白化处理，然后利用稀疏自动编码器进行无监督局部特征学习。在预处理阶段，针对YUV空间亮度和色度通道相互独立的特性，提出一种将亮度和色度进行分离的白化措施。最后用学习到的局部特征在大尺寸图像上进行卷积操作从而获得全局特征，并送入图像分类系统进行性能测试。实验结果表明：只要对亮度分量进行适当的白化处理，在YUV空间中的无监督特征学习就能够获得相当于甚至优于RGB空间的彩色图像分类性能。

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	李祖贺
	樊养余
	王凤琴

关键词 ：图像分类, 无监督特征学习, 稀疏自动编码器, 卷积神经网络, 深度学习

Abstract：

Existing unsupervised feature learning algorithms usually extract features in RGB color space, but YUV color space is widely adopted in image and video compression standards. In order to take advantage of human visual characteristics and avoid the calculation consumption caused by color space conversion, an unsupervised feature learning approach in YUV space based on sparse autoencoders is presented. First, image patches in YUV space are randomly sampled and whitened, and then are fed into sparse autoencoders to learn local features in an unsupervised way. Considering the characteristic that the luminance channel and chrominance channels are independent in YUV space, a whitening method which treats the luminance and chrominance separately is proposed in the pre-processing step. Finally, features learned over local image patches are convolved with large-size images in order to get global feature activations. Global features are then sent into image classification systems for performance testing. Experimental results reveal that unsupervised feature learning in YUV space achieves similar or even slightly better performance in color image classification compared with that in RGB space as long as the luminance component is whitened properly.

Key words： Image classification Unsupervised feature learning Sparse AutoEncoder (SAE) Convolutional neural network Deep learning

收稿日期: 2015-05-11 出版日期: 2015-11-19

PACS:

TP391.4

基金资助:

陕西省科技统筹创新工程重点实验室项目(2013SZS15- K02)

通讯作者: 李祖贺：男，1983年生，讲师，博士生，研究方向为计算机视觉、机器学习. E-mail: zuheli@126.com

作者简介: 李祖贺：男，1983年生，讲师，博士生，研究方向为计算机视觉、机器学习. 樊养余：男，1960年生，教授，博士生导师，研究方向为图像处理及其应用、虚拟现实及可视化技术. 王凤琴：女，1980年生，副教授，博士，研究方向为图像处理、视频编码.

引用本文:

李祖贺,樊养余,王凤琴. YUV空间中基于稀疏自动编码器的无监督特征学习[J]. 电子与信息学报, 2016, 38(1): 29-37. LI Zuhe, FAN Yangyu, WANG Fengqin. Unsupervised Feature Learning with Sparse Autoencoders in YUV Space. JEIT, 2016, 38(1): 29-37.

链接本文:

http://jeit.ie.ac.cn/CN/10.11999/JEIT150557 或 http://jeit.ie.ac.cn/CN/Y2016/V38/I1/29

[1]	BENGIO Y, COURVILLE A, and VINCENT P. Representation learning: a review and new perspectives[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013, 35(8): 1798-1828.
[2]	COATES A, NG A Y, and LEE H. An analysis of single-layer networks in unsupervised feature learning[C]. Preceedings of the 14th International Conference on Artificial Intelligence and Statistics, Ft. Lauderdale, 2011: 215-223.
[3]	KRIZHEVSKY A, SUTSKEVER I, and HINTON G E. Imagenet classification with deep convolutional neural networks[C]. Preceedings of the 26th Annual Conference on Neural Information Processing Systems (NIPS), Lake Tahoe, 2012: 1097-1105.
[4]	MASCI J, Meier U, CIREAN D, et al. Stacked convolutional auto-encoders for hierarchical feature extraction[C]. Preceedings of the 21st International Conference on Artificial Neural Networks, Espoo, 2011: 52-59.
[5]	LI Z, FAN Y, and LIU W. The effect of whitening transformation on pooling operations in convolutional autoencoders[J]. EURASIP Journal on Advances in Signal Processing, 2015, 2015(1): 1-11.
[6]	VINCENT P, LAROCHELLE H, Lajoie I, et al. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion[J]. The Journal of Machine Learning Research, 2010, 11(Dec): 3371-3408.
[7]	YIN H, JIAO X, CHAI Y, et al. Scene classification based on single-layer SAE and SVM[J]. Expert Systems with Applications, 2015, 42(7): 3368-3380.
[8]	ZHANG F, DU B, and ZHANG L. Saliency-guided unsupervised feature learning for scene classification[J]. IEEE Transactions on Geoscience and Remote Sensing, 2015, 53(4): 2175-2184.
[9]	LÄNGKVIST M and LOUTFI A. Learning feature representations with a cost-relevant sparse autoencoder[J]. International Journal of Neural Systems, 2015, 25(1): 1-11.
[10]	LIU H L, Taniguchi T, TAKANO T, et al. Visualization of driving behavior using deep sparse autoencoder[C]. Preceedings of the 2014 IEEE Intelligent Vehicles Symposium, Dearborn, 2014: 1427-1434.
[11]	SERMANET P, KAVUKCUOGLU K, CHINTALA S, et al. Pedestrian detection with unsupervised multi-stage feature learning[C]. Proceedings of Computer Vision and Pattern Recognition (CVPR), Portland, 2013: 3626-3633.
[12]	SHIN H C, ORTON M R, COLLINS D J, et al. Stacked autoencoders for unsupervised feature learning and multiple organ detection in a pilot study using 4D patient data[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013, 35(8): 1930-1943.
[13]	SANKARAN A, PANDEY P, VATSA M, et al. On latent fingerprint minutiae extraction using stacked denoising sparse AutoEncoders[C]. Proceedings of the 2014 IEEE International Joint Conference on Biometrics (IJCB), Clearwater, 2014: 1-7.
[14]	孙志军, 薛磊, 许阳明. 基于深度学习的边际 Fisher 分析特征提取算法[J]. 电子与信息学报, 2013, 35(4): 805-811. doi: 10.3724/SP.J1146.2012.00949.
	SUN Zhijun, XUE Lei, and XU Yangming. Marginal Fisher feature extraction algorithm based on deep learning[J]. Journal of Electronics & Information Technology, 2013, 35(4): 805-811. doi: 10.3724/SP.J1146.2012.00949.
[15]	刘高平, 赵杜娟, 黄华. 基于自编码神经网络重构的车牌数字识别[J]. 光电子·激光, 2011, 22(1): 144-148.
	LIU Gaoping, ZHAO Dujuan, and HUANG Hua. Character recognition of license plate using autoencoder neural network reconstruction[J]. Journal of Optoelectronics·Laser, 2011, 22(1): 144-148.
[16]	SUDHIR R and BABOO L D S S. An efficient CBIR technique with YUV color space and texture features[J]. Computer Engineering and Intelligent Systems, 2011, 2(6): 85-95.
[17]	BELL A J and SEJNOWSKI T J. Edges are the “independent components” of natural scenes[C]. Proceedings of the 10th Annual Conference on Neural Information Processing Systems (NIPS), Denver, 1997: 831-837.
[18]	NG A Y, NGIAM J, FOO C Y, et al. Unsupervised feature learning and deep learning[OL]. http://deeplearning.stanford. edu/wiki/index.php, 2015, 7.
[19]	NG A Y. Sparse autoencoder[OL]. http://web.stanford.edu/ class/cs294a/sparseAutoencoder.pdf, 2015, 4.
[20]	郑歆慰, 胡岩峰, 孙显, 等. 基于空间约束多特征联合稀疏编码的遥感图像标注方法研究[J]. 电子与信息学报, 2014, 36(8): 1891-1898. doi: 10.3724/SP.J1146.2013.01433.
	ZHENG Xinwei, HU Yanfeng, SUN Xian, et al. Annotation of remote sensing images using spatial constrained multi-feature joint sparse coding[J]. Journal of Electronics & Information Technology, 2014, 36(8): 1891-1898. doi: 10.3724/SP.J1146.2013.01433.
[21]	CIRESAN D C, MEIER U, MASCI J, et al. Flexible, high performance convolutional neural networks for image classificationC]. Proceedings of the 22nd?International Joint Conference on Artificial Intelligence, Barcelona, 2011: 1237-1242.
[22]	BOUREAU Y L, PONCE J, and lEcUN Y. A theoretical analysis of feature pooling in visual recognition[C]. Proceedings of the 27th International Conference on Machine Learning (ICML), Haifa, 2010: 111-118.
[23]	ZENG R, WU J, SHAO Z, et al. Quaternion softmax classifier[J]. Electronics Letters, 2014, 50(25): 1929-1930.?