融合多尺度特征的深度哈希图像检索方法

doi:10.3778/j.issn.1673-9418.1709060

计算机科学与探索 ›› 2018, Vol. 12 ›› Issue (12): 1974-1986.DOI: 10.3778/j.issn.1673-9418.1709060

融合多尺度特征的深度哈希图像检索方法

周书仁，谢盈，蔡碧野

1. 长沙理工大学综合交通运输大数据智能处理湖南省重点实验室，长沙 410114

2. 长沙理工大学计算机与通信工程学院，长沙 410114

出版日期:2018-12-01 发布日期:2018-12-07

Deep Hashing Method for Image Retrieval Based on Multi-Scale Features

ZHOU Shuren, XIE Ying, CAI Biye

1. Hunan Provincial Key Laboratory of Intelligent Processing of Big Data on Transportation, Changsha University of Science and Technology, Changsha 410114, China
2. School of Computer and Communication Engineering, Changsha University of Science and Technology, Changsha 410114, China

Online:2018-12-01 Published:2018-12-07

摘要/Abstract

摘要：

近期，许多学者开始结合深度网络进行图像处理，与传统的基于人工抽取的特征相比，由深度卷积网络提取的特征更为准确，效果更好。因此，提出了一种结合卷积神经网络和哈希算法的深度网络架构，用于大规模图像检索。一方面，为了更好地保持哈希编码之间的语义相似性，引入了多任务学习机制，将图像分类信息和图像间的相似度信息同时用于模型的训练，并且根据信息熵理论，训练过程中使哈希编码尽可能地维持均匀分布以增加信息量；另一方面，提出了一种多尺度融合池化方法（multi-scale fusion pooling，MSFP），融合图像中多种尺度的区域信息，提升了检索性能，同时明显地减少了网络参数。在SVHN、CIFAR-10和NUS-WIDE等数据集上的实验结果表明，提出的算法与现有的基准算法相比，检索效果具有明显改善。

关键词: 图像检索, 卷积神经网络, 池化, 哈希编码

Abstract:

Recently, many scholars have done research on the methods of image processing by combining deep neural networks. Compared with the traditional hand-crafted feature, the feature extracted by deep convolutional networks is more accurate and better. Therefore, this paper proposes a deep network architecture combined with convolutional neural networks and Hashing method for large-scale image retrieval. On the one hand, in order to preserve semantic similarity between binary Hash codes as much as possible, this paper introduces the multi-task learning that the similarity information and the classified information are used to train the network. Moreover, according to the information entropy theory, Hash coding is maintained as evenly distributed as possible to increase the amount of information during the training process. On the other hand, this paper proposes a multi-scale fusion pooling method (MSFP), which can fuse the regional information of various scales, and improve the retrieval performance. Meanwhile, the network parameters are obviously reduced by the pooling strategy. Extensive experiments on three large scale datasets SVHN, CIFAR-10 and NUS-WIDE show the improved performance of this method compared with the state-of-the-arts.

Key words: image retrieval, convolutional neural networks, pooling, Hashing

周书仁，谢盈，蔡碧野. 融合多尺度特征的深度哈希图像检索方法[J]. 计算机科学与探索, 2018, 12(12): 1974-1986.

ZHOU Shuren, XIE Ying, CAI Biye. Deep Hashing Method for Image Retrieval Based on Multi-Scale Features[J]. Journal of Frontiers of Computer Science and Technology, 2018, 12(12): 1974-1986.

[1]	任龙杰，孙颖，丁卫平，鞠恒荣，曹金鑫. 基于单种群蛙跳优化CNN的眼底图像多病变检测[J]. 计算机科学与探索, 2021, 15(9): 1762-1772.
[2]	张梦倩，张莉. 粗-细两阶段卷积神经网络算法[J]. 计算机科学与探索, 2021, 15(8): 1501-1510.
[3]	方钧婷，谭晓阳. 注意力级联网络的金属表面缺陷检测算法[J]. 计算机科学与探索, 2021, 15(7): 1245-1254.
[4]	能文鹏，陆军，赵彩虹. 基于关系归纳偏置的睡眠分期综述[J]. 计算机科学与探索, 2021, 15(6): 1026-1037.
[5]	赵小强，徐慧萍. 分级特征融合的图像语义分割[J]. 计算机科学与探索, 2021, 15(5): 949-957.
[6]	马丹，万良，程琪芩，孙志强. Attention-CNN在恶意代码检测中的应用研究[J]. 计算机科学与探索, 2021, 15(4): 670-681.
[7]	张利，邱存月，张凯鑫，张大波，罗浩. 改进胶囊网络优化分层卷积的亚健康识别算法[J]. 计算机科学与探索, 2021, 15(4): 712-722.
[8]	肖振久，杨晓迪，魏宪，唐晓亮. 改进的轻量型网络在图像识别上的应用[J]. 计算机科学与探索, 2021, 15(4): 743-753.
[9]	谭娅娅，孔广黔. 基于深度学习的视频质量评价研究综述[J]. 计算机科学与探索, 2021, 15(3): 423-437.
[10]	钱慧芳，易剑平，付云虎. 基于深度学习的人体动作识别综述[J]. 计算机科学与探索, 2021, 15(3): 438-455.
[11]	杨晨旭，蔡克参，张红云，苗夺谦. 基于人脸图像的二阶段性别分类算法[J]. 计算机科学与探索, 2021, 15(3): 524-532.
[12]	柴恩惠，马占飞，智敏. Norm-DP模型行人检测优化算法[J]. 计算机科学与探索, 2021, 15(3): 545-552.
[13]	刘慧琳, 冯跃, 徐红, 罗坚义. 深度学习的舌体分割研究综述[J]. 计算机科学与探索, 2021, 15(12): 2276-2291.
[14]	李兴秀, 唐建军, 华晶. 结合CNN与双向LSTM的心律失常分类[J]. 计算机科学与探索, 2021, 15(12): 2353-2361.
[15]	李文涛, 彭力. 多尺度通道注意力融合网络的小目标检测算法[J]. 计算机科学与探索, 2021, 15(12): 2390-2400.

融合多尺度特征的深度哈希图像检索方法

Deep Hashing Method for Image Retrieval Based on Multi-Scale Features

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics