Journal of Frontiers of Computer Science and Technology ›› 2022, Vol. 16 ›› Issue (7): 1633-1648.DOI: 10.3778/j.issn.1673-9418.2012028

• Graphics and Image • Previous Articles     Next Articles

Target Tracking System Constructed by ELM-AE and Transfer Representation Learning

YANG Zheng1, DENG Zhaohong1,+(), LUO Xiaoqing2, GU Xin2, WANG Shitong1   

  1. 1. School of A.pngicial Intelligence and Computer Science, Jiangnan University, Wuxi, Jiangsu 214122, China
    2. Jiangsu North Huguang Opto-Electronics Co., Ltd., Wuxi, Jiangsu 214035, China
  • Received:2020-12-03 Revised:2021-01-28 Online:2022-07-01 Published:2021-02-05
  • Supported by:
    the National Natural Science Foundation of China(61772239);the National Natural Science Foundation of China(61772237)

利用ELM-AE和迁移表征学习构建的目标跟踪系统

杨政1, 邓赵红1,+(), 罗晓清2, 顾鑫2, 王士同1   

  1. 1.江南大学 人工智能与计算机学院,江苏 无锡 214122
    2.江苏北方湖光光电有限公司,江苏 无锡 214035
  • 作者简介:杨政(1989—),男,江苏连云港人,硕士研究生,主要研究方向为特征迁移及其在目标跟踪中的应用。
    YANG Zheng, born in 1989, M.S. candidate. His research interests include feature transfer and its application in target tracking.
    邓赵红(1981—),男,安徽蒙城人,教授,博士生导师,CCF杰出会员,主要研究方向为不确定性人工智能及其应用。
    DENG Zhaohong, born in 1981, professor, Ph.D. supervisor, distinguished member of CCF. His research interests include uncertainty a.pngicial intelligence and its applications.
    罗晓清(1980—),女,江西南昌人,副教授,主要研究方向为图像融合、模式识别、图像处理等。
    LUO Xiaoqing, born in 1980, associate professor. Her research interests include image fusion, pattern recognition, image processing, etc.
    顾鑫(1979—),男,江苏张家港人,博士,高级工程师,主要研究方向为模式识别、人工智能、图像处理技术研究与应用。
    GU Xin, born in 1979, Ph.D., senior engineer. His research interests include pattern recognition, a.pngicial intelligence, image processing tech-nology and its application.
    王士同(1964—),男,江苏扬州人,教授,博士生导师,主要研究方向为人工智能、模式识别等。
    WANG Shitong, born in 1964, professor, Ph.D. supervisor. His research interests include a.pngicial intelligence, pattern recognition, etc.
  • 基金资助:
    国家自然科学基金面上项目(61772239);国家自然科学基金面上项目(61772237)

Abstract:

In the target tracking algorithm, the feature model’s ability to quickly learn image features and the ability to adapt to changes in target features during tracking has always been one of the main research directions of target tracking algorithms. Especially for discriminative target trackers based on image block learning, these two points have become decisive factors affecting the efficiency and robustness of the tracker. However, the performance of most existing similar algorithms on these two abilities cannot achieve satisfactory results. To solve this problem, an efficient and robust feature model is proposed. The feature model first uses extreme learning machine autoencoder (ELM-AE) to quickly perform random feature mapping on complex image features of the target and background image blocks, and then uses the transfer learning ability of transfer representation learning (TRL) to improve the adaptability of random feature space. The feature model is named transfer representation learning with ELM-AE (TRL-ELM-AE). Compared with original complex image features, this model can provide the classifier with more compact and expressive shared features, so that the classifier can learn and classify more quickly and efficiently. In addition, in the target tracking process, the target and background usually change continuously over time. Although the feature migration capability of TRL can already adapt to this, in order to further improve the robustness of the tracker, a strategy of dynamically updating training samples is adopted. Through a large number of experimental and analysis results on the 11 target tracking challenge scenarios proposed by OTB, it is proven that the proposed target tracker has significant advantages over the existing target tracker.

Key words: extreme learning machine (ELM), extreme learning machine autoencoder (ELM-AE), transfer represen-tation learning (TRL), feature adaptation, Gaussian naive Bayes classifier (GNBC), object tracking

摘要:

在目标跟踪算法中,特征模型对图像特征的快速学习能力和对跟踪过程中目标特征变化的自适应能力一直是目标跟踪算法的主要研究方向之一。特别是对于基于图像块学习的判别式目标跟踪器而言,这两点已然成为影响跟踪器效率和鲁棒性的决定性因素。然而,现有的大多数同类算法在这两个能力上的性能并不能达到令人满意的效果。为了解决这一问题,提出了一种高效且鲁棒的特征模型。该特征模型首先利用基于极限学习机的自编码器(ELM-AE)对目标和背景图像块的复杂图像特征快速地进行随机特征映射,再利用迁移表征学习(TRL)的迁移学习能力提高随机特征空间的自适应性。将该特征模型命名为基于ELM自编码器和迁移表征学习的特征模型(TRL-ELM-AE)。与原复杂图像特征相比,通过该模型可以获得更加紧凑且具有表达能力的共享特征。从而使得分类器可以快速高效地学习和分类。此外,在目标跟踪过程中,目标与背景通常会随着时间不停地变化。虽然TRL的特征迁移能力已经可以很好地适应这一点,但是为了进一步提高跟踪器的鲁棒性,还采用了一种动态更新训练样本的策略。通过对OTB提出的11项目标跟踪挑战场景进行大量实验和分析,证明了所提的目标跟踪器较现有的目标跟踪器具有显著优势。

关键词: 极限学习机(ELM), 极限学习机自编码器(ELM-AE), 迁移表征学习(TRL), 特征自适应, 高斯朴素贝叶斯分类器(GNBC), 目标跟踪

CLC Number: