基于Dropout深度网络的两步图像标注算法

doi:10.3778/j.issn.1673-9418.1505015

计算机科学与探索 ›› 2015, Vol. 9 ›› Issue (12): 1494-1505.DOI: 10.3778/j.issn.1673-9418.1505015

基于Dropout深度网络的两步图像标注算法

杨阳+，张文生，杨雪冰

中国科学院自动化研究所，北京 100190

出版日期:2015-12-01 发布日期:2015-12-04

Two Steps Image Annotation Algorithm Based on Deep Network with Dropout

YANG Yang+, ZHANG Wensheng, YANG Xuebing

Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China

Online:2015-12-01 Published:2015-12-04

摘要/Abstract

摘要： 基于文本的图像检索技术强烈依赖于图像标签，深度学习可以用来实现图像标签的自动生成。多分类器融合是一种有效提升分类器精度的方法。为了提升深度学习模型的泛化性能，提出了Dropout算法。该方法的本质是在训练过程中随机地丢弃若干神经元，等价于同时训练多个子网络。由于图像标签的多样性，提出了两步标签融合算法：第一步，根据多个不同网络的输出将图像标签词汇分为基准词汇、备选词汇和无关词汇；第二步，选出备选词汇中与基准词汇强相关的词汇，基准词汇和被选出的词汇可作为图像的标签。最后，算法选取3个常用的数据集对提出的算法模型进行验证，实验结果表明，多分类器融合算法可以有效地解决图像自动标注问题。

关键词: 图像自动标注, 深度学习, 集成学习, 机器学习

Abstract: The performance of text-based image retrieval is highly dependent on manual tagging, and the deep learning can be used to realize image keywords generated automatically. Combining the predictions of many different large neural nets is an effective way for improving the classification accuracy. Firstly, for improving the generalization performance of the deep learning model, this paper proposes the Dropout algorithm. Dropout is a technique for addressing this problem by randomly dropping units (along with their connections) from the neural network during training. So the algorithm is equivalent to train many neural networks for prediction. Next, by the reason of the diverse keywords of image, this paper proposes a two steps algorithm for image annotation. First step, the keywords are divided into three parts: base keywords, candidate keywords and irrelevant keywords depending on the output of all neural networks. Second step, the keywords are chosen in candidate set depending on their correlation with base keywords. At last, the base keywords and chosen keywords are labeled for images. Conducting extensive experiments on three popular data sets, the results demonstrate that the proposed framework can achieve favorable performance for image annotation.

Key words: image auto-annotation, deep learning, assemble learning, machine learning

杨阳，张文生，杨雪冰. 基于Dropout深度网络的两步图像标注算法[J]. 计算机科学与探索, 2015, 9(12): 1494-1505.

YANG Yang, ZHANG Wensheng, YANG Xuebing . Two Steps Image Annotation Algorithm Based on Deep Network with Dropout[J]. Journal of Frontiers of Computer Science and Technology, 2015, 9(12): 1494-1505.

[1]	王迪聪, 白晨帅, 邬开俊. 基于深度学习的视频目标检测综述[J]. 计算机科学与探索, 2021, 15(9): 1563-1577.
[2]	张晓旭, 马志强, 刘志强, 朱方圆, 王春喻. Transformer在语音识别任务中的研究现状与展望[J]. 计算机科学与探索, 2021, 15(9): 1578-1594.
[3]	陈璠, 彭力. 多层级重叠条纹特征融合的行人重识别[J]. 计算机科学与探索, 2021, 15(9): 1753-1761.
[4]	武家伟, 孙艳春. 融合知识图谱和深度学习方法的问诊推荐系统[J]. 计算机科学与探索, 2021, 15(8): 1432-1440.
[5]	马煜, 杜慧敏, 毛智礼, 张霞. 深度语义分割人群密度检测技术[J]. 计算机科学与探索, 2021, 15(8): 1469-1475.
[6]	荣欢, 马廷淮. 利用收益预测与策略梯度两阶段众包评论集成[J]. 计算机科学与探索, 2021, 15(8): 1476-1489.
[7]	马玉琨, 徐姚文, 赵欣, 徐涛, 王泽瑞. 人脸识别系统的活体检测综述[J]. 计算机科学与探索, 2021, 15(7): 1195-1206.
[8]	葛轶洲, 许翔, 杨锁荣, 周青, 申富饶. 序列数据的数据增强方法综述[J]. 计算机科学与探索, 2021, 15(7): 1207-1219.
[9]	方钧婷, 谭晓阳. 注意力级联网络的金属表面缺陷检测算法[J]. 计算机科学与探索, 2021, 15(7): 1245-1254.
[10]	杨悦, 王士同. 随机特征映射的四层神经网络及其增量学习[J]. 计算机科学与探索, 2021, 15(7): 1265-1278.
[11]	田萱, 丁琪, 廖子慧, 孙国栋. 基于深度学习的新闻推荐算法研究综述[J]. 计算机科学与探索, 2021, 15(6): 971-998.
[12]	能文鹏, 陆军, 赵彩虹. 基于关系归纳偏置的睡眠分期综述[J]. 计算机科学与探索, 2021, 15(6): 1026-1037.
[13]	吕昊远, 俞璐, 周星宇, 邓祥. 半监督深度学习图像分类方法研究综述[J]. 计算机科学与探索, 2021, 15(6): 1038-1048.
[14]	马宇, 张丽果, 杜慧敏, 毛智礼. 卷积神经网络的交通标志语义分割[J]. 计算机科学与探索, 2021, 15(6): 1114-1121.
[15]	汤凌燕, 熊聪聪, 王嫄, 周宇博, 赵子健. 基于深度学习的短文本情感倾向分析综述[J]. 计算机科学与探索, 2021, 15(5): 794-811.

基于Dropout深度网络的两步图像标注算法

Two Steps Image Annotation Algorithm Based on Deep Network with Dropout

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics