分类激活图增强的图像分类算法

doi:10.3778/j.issn.1673-9418.1902025

计算机科学与探索 ›› 2020, Vol. 14 ›› Issue (1): 149-158.DOI: 10.3778/j.issn.1673-9418.1902025

分类激活图增强的图像分类算法

杨萌林，张文生

1.中国科学院自动化研究所精密感知与控制研究中心，北京 100190
2.中国科学院大学人工智能学院，北京 100049

出版日期:2020-01-01 发布日期:2020-01-09

Image Classification Algorithm Based on Classification Activation Map Enhancement

YANG Menglin, ZHANG Wensheng

1.Research Center of Precision Sensing and Control, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
2.School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 100049, China

Online:2020-01-01 Published:2020-01-09

摘要/Abstract

摘要： 分类激活图（CAM）具有稀疏、不连续、不完整等问题，并且目前大部分研究仅将其用于可视化分析。基于此，首先利用扩张卷积设计了自动加权的多尺度特征学习来弥补分类激活图存在的问题，并将该多尺度特征与分类激活图生成方法结合，设计了多尺度分类激活图生成方法。进一步，将该多尺度的分类激活图嵌入到网络中构成了端到端的结构，实现分类性能增强的目的。以残差网络ResNet为骨干网络，提出了分类增强模型ResNet-CE。在三个公开数据集CIFAR10、CIFAR100和STL10上，对该模型进行了大量的实验。实验表明：ResNet-CE在这三个数据集上的分类性能与参数量相当的ResNet相比有明显的提升，识别的错误率分别降低了0.23%、3.56%和7.96%，并且分类性能优于当前大部分的分类网络。提出的算法能够简单地迁移到已有的分类模型中，提高原有模型的分类性能。同时，该算法保留了对模型判断依据可视化和解释的功能，这在医疗影像中的疾病识别、无人驾驶的场景识别等场景中具有一定的应用价值和意义。

关键词: 图像分类, 分类激活图（CAM）, 多尺度, 可视化, 可解释性

Abstract: Classification activation map (CAM) has problems such as sparseness, discontinuity, incompleteness, etc.,and most of the current research only uses it for visual analysis. Based on this, this paper firstly utilizes the dilated convolution to design an automatic weighted multi-scale feature learning method in order to compensate for the defects of CAM and combines the multi-scale feature with the generation method of CAM to develop a multi-scale CAM generation method. Further, this paper embeds the multi-scale CAM into the network to form an end-to-end structure in order to enhance the classification performance. Taking the ResNet as the backbone, this paper proposes a classification enhancement model, ResNet-CE. Extensively experiments are conducted with ResNet-CE on three publicly available datasets, CIFAR10, CIFAR100 and STL10. Experiments show that the classification performance of ResNet-CE on these three datasets is significantly improved compared with the ResNet with similar parameters quantity. The error rates are reduced by 0.23%, 3.56% and 7.96%, respectively and the classification performance is better than most mainstream classification models. The proposed model can be easily transferred to the off-the-shelf model to improve its classification performance. At the same time, the algorithm retains the function of visualization and interpretation of the judgment of the model, which has certain application value and significance in scenes, such as diseases recognition in medical image and scene recognition in unmanned driving, etc.

Key words: image classification, classification activation map (CAM), multiscale, visualization, interpretability

杨萌林，张文生. 分类激活图增强的图像分类算法[J]. 计算机科学与探索, 2020, 14(1): 149-158.

YANG Menglin, ZHANG Wensheng. Image Classification Algorithm Based on Classification Activation Map Enhancement[J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(1): 149-158.

155

HTML			PDF

最新录用	在线预览	正式出版	最新录用	在线预览	正式出版
0	0	0	0	0	155

来源	本网站	其他网站

次数	141	14
比例	91%	9%

摘要

250

最新录用	在线预览	正式出版

0	0	250

	来源	本网站

	次数	250
	比例	100%

[1]	张梦倩，张莉. 粗-细两阶段卷积神经网络算法[J]. 计算机科学与探索, 2021, 15(8): 1501-1510.
[2]	刘靖祎，史彩娟，涂冬景，刘帅. 零样本图像分类综述[J]. 计算机科学与探索, 2021, 15(5): 812-824.
[3]	郑娅峰，赵亚宁，白雪，傅骞. 教育大数据可视化研究综述[J]. 计算机科学与探索, 2021, 15(3): 403-422.
[4]	刘晓龙，王士同. 面向开放集图像分类的模糊域自适应方法[J]. 计算机科学与探索, 2021, 15(3): 515-523.
[5]	杨章静, 王文博, 黄璞, 张凡龙. 基于潜子空间去噪的子空间学习图像分类方法[J]. 计算机科学与探索, 2021, 15(12): 2374-2389.
[6]	安平，冀中，刘西瑶. 任务感知双原型网络的人物交互少样本识别[J]. 计算机科学与探索, 2021, 15(11): 2184-2192.
[7]	李祥霞，吉晓慧，李彬. 细粒度图像分类的深度学习方法[J]. 计算机科学与探索, 2021, 15(10): 1830-1842.
[8]	马翔，邓赵红，王士同. 多粒度融合的模糊规则系统图像特征学习[J]. 计算机科学与探索, 2021, 15(1): 173-184.
[9]	王晓东，赵一宁，肖海力，王小宁，迟学斌. 线上多节点日志流量异常检测系统的研究[J]. 计算机科学与探索, 2020, 14(11): 1828-1837.
[10]	宗海燕，吴秦，王田辰，张淮. 核协同表示下的多特征融合场景识别[J]. 计算机科学与探索, 2019, 13(6): 1038-1048.
[11]	陈幻杰，王琦琦，杨国威，韩佳林，尹成娟，陈隽，王以忠. 多尺度卷积特征融合的SSD目标检测算法[J]. 计算机科学与探索, 2019, 13(6): 1049-1061.
[12]	唐爽，张灵箫，赵俊峰，谢冰，邹艳珍. 面向多源数据的可扩展主题建模分析框架[J]. 计算机科学与探索, 2019, 13(5): 742-752.
[13]	陈德运，付立军，张学松，于梁，陈海龙，李骜. 多种表示的图像分类方法[J]. 计算机科学与探索, 2019, 13(12): 2138-2148.
[14]	任宇杰，杨剑，刘方涛，张启尧. 基于SSD和MobileNet网络的目标检测方法的研究[J]. 计算机科学与探索, 2019, 13(11): 1881-1893.
[15]	曹雅，邓赵红，王士同. 单调约束的TSK模糊系统模型[J]. 计算机科学与探索, 2018, 12(9): 1487-1495.

分类激活图增强的图像分类算法

Image Classification Algorithm Based on Classification Activation Map Enhancement

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐 0

Metrics