Target Tracking Based on Enhanced Flock of Tracker and Deep Learning
Cheng Shuai① Cao Yong-gang①② Sun Jun-xi③ Zhao Li-rong①② Liu Guang-wen① Han Guang-liang②
①(School of Electronic Information Engineering, Changchun University of Science and Technology, Changchun 130022, China) ②(Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130000, China) ③(School of Computer Science and Information Technology, Northeast Normal University, Changchun 130117, China)
To solve the problem that the tracking algorithm often leads to drift and failure based on the appearance model and traditional machine learning, a tracking algorithm is proposed based on the enhanced Flock of Tracker (FoT) and deep learning under the Tracking-Learning-Detection (TLD) framework. The target is predicted and tracked by the FoT, the cascaded predictor is added to improve the precision of the local tracker based on the spatio-temporal context, and the global motion model is evaluated by the speed-up random sample consensus algorithm to improve the accuracy. A deep detector is composed of the stacked denoising autoencoder and Support Vector Machine (SVM), combines with a multi-scale scanning window with global search strategy to detect the possible targets. Each sample is weighted by the weighted P-N learning to improve the precision of the deep detector. Compared with the state-of-the-art trackers, according to the results of experiments on variant challenging image sequences in the complex environment, the proposed algorithm has more accuracy and better robust, especially for the occlusions, the background clutter and so on.
程帅,曹永刚,孙俊喜,赵立荣,刘广文,韩广良. 基于增强群跟踪器和深度学习的目标跟踪[J]. 电子与信息学报, 2015, 37(7): 1646-1653.
Cheng Shuai,Cao Yong-gang, Sun Jun-xi, Zhao Li-rong, Liu Guang-wen, Han Guang-liang. Target Tracking Based on Enhanced Flock of Tracker and Deep Learning. JEIT, 2015, 37(7): 1646-1653.
Wu Y, Lim J, and Yang M H. Online object tracking: A benchmark[C]. Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Portland, USA, 2013: 2411-2418.
[2]
Ross D A, Lim J, Lin R S, et al.. Incremental learning for robust visual tracking[J]. International Journal of Computer Vision, 2008, 77(3): 125-141.
[3]
Babenko B, Yang M H, and Belongie S. Robust object tracking with online multiple instance learning[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011, 33(8): 1619-1632.
Chen Dong-cheng, Zhu Ming, Gao Wen, et al.. Real-time object tracking via online weighted multiple instance learning [J]. Optics and Precision Engineerin, 2014, 22(6): 1661-1667.
[5]
He S F, Yang Q X, Rynson L, et al.. Visual Tracking via Locality Sensitive Histograms[C]. Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Portland, USA, 2013: 2427-2434.
[6]
Grabner H, Grabner M, and Bischof H. Real-time tracking via online boosting[C]. Proceedings of British Machine Vision Conference, Edinburgh, UK, 2006: 47-56.
[7]
Grabner H, Leistner C, and Bischof H. Semi-supervised on-line boosting for robust tracking[C]. Proceedings of European Conference on Computer Vision, Berlin, Germany, 2008: 234-247.
Yan Jia and Wu Ming-yuan. On-line boosting based target tracking under occlusion[J]. Optics and Precision Engineering, 2012, 20(2): 439-446.
[9]
Kalal Z, Matas J, and Mikolajczyk K. P-N learning: bootstrapping binary classifiers by structural constraints[C]. Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, New York, USA, 2010: 49-56.
Zheng Ying, Chen Quan-qi, and Zhang Yu-jin. Deep learning and its new progress in object and behavior recognition[J]. Journal of Image and Graphic, 2014, 19(2): 175-184.
[11]
Tomas V and Jiri M. Robustifying the flock of trackers[C]. Proceedings of Computer Vision Winter Workshop, Graz, Austria, 2011: 91-97.
Zhou Xin, Qian Qiu-meng, Ye Yong-qiang, et al.. Improved TLD visual target tracking algorithm[J]. Journal of Image and Graphic, 2013, 18(9): 1115-1123.
[13]
Kalal Z, Mikolajczyk K, and Matas J. Tracking-learning- detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34(7): 1409-1422.
[14]
Zhang K, Zhang L, Liu Q, et al.. Fast visual tracking via dense spatio-temporal context learning[C]. Proceedings of European Conference on Computer Vision, Zurich, Switzerland, 2014: 127-141.
[15]
Botterill T, Mills S, and Green R D. New conditional sampling strategies for speeded-up RANSAC[C]. Proceedings of British Machine Vision Conference, London, UK, 2009: 1-11.
[16]
Vincent P, Larochelle H, Lajoie I, et al.. Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion[J]. Journal of Machine Learning Research, 2010, 11(2): 3371-3408.
[17]
Tang Yi-chuan. Deep learning using linear support vector machines[C]. Proceedings of International Conference on Machine Learning: Challenges in Representational Learning Workshop, Atlanta, USA, 2013: 266-272.
[18]
Hinton G E and Salakhutdinov R R. Reducing the dimensionality of data with neural networks[J]. Science, 2006, 313(5786): 504-507.
[19]
Torralba A, Fergus R, and Freeman W T. 80 million tiny images: a large data set for nonparametric object and scene recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008, 30(11): 1958-1970.
Gao Wen, Tang Yang, and Zhu Ming. Study on the cascade classifier in target detection under complex background[J]. Acta Physica Sinica, 2014, 63(9): 094204.
[21]
Collins R T, Zhou X H, and Teh S K. An open source tracking test bed and evaluation web site[C]. Proceedings of IEEE International Workshop on Performance Evaluation of Tracking and Surveillance, Breckenridge, USA, 2005: 17-24.
[22]
Stalder S, Grabner H, and Van G L. Beyond semi-supervised tracking: tracking should be as simple as detection, but not simpler than recognition[C]. Proceedings of IEEE Conference on Computer Vision Workshops, Kyoto, Japan, 2009: 1409-1416.
[23]
Dinh T B, Vo N, and Medion G. Context tracker: exploring supporters and distracters in unconstrained environments[C]. Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Providence, USA, 2011: 1177-1184.
[24]
Qian Yu, Thang B D, and Gerard M. Online tracking and reacquisition using co-trained generative and discriminative trackers[C]. Proceedings of European Conference on Computer Vision, Marseille, France, 2008: 678-691.