Combining Spatial and Semantic Information to Represent Image

doi:10.3778/j.issn.1673-9418.1305016

Journal of Frontiers of Computer Science and Technology ›› 2013, Vol. 7 ›› Issue (10): 896-904.DOI: 10.3778/j.issn.1673-9418.1305016

Previous Articles Next Articles

Combining Spatial and Semantic Information to Represent Image

ZHAO Yue, YU Jian+, XIE Bojun

School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China

Online:2013-10-01 Published:2013-09-30

结合空间语义信息的图像表示方法

赵悦，于剑+，谢博鋆

北京交通大学计算机与信息技术学院，北京 100044

Abstract

Abstract: Recently, bag-of-words (BoW) model has been approved by researchers due to their good performance. There are mainly two categories of bag-of-words models. One is to add spatial information into the image feature representation and the other is to add semantic information. This paper proposes an image feature representation method which combines the spatial information between feature points with the semantic information, and makes the feature show better performance. This paper extracts similar visual words by computing distribution divergence and forms visual phrase, which can present the meaning of image. Image classification experiments based on this method are conducted on UIUC-Sports8 dataset and Scene-15 dataset, and the results show that the visual phrase method has better classification accuracy compared with the conventional bag-of-words model and other models.

Key words: feature representation, bag-of-words model, visual word, visual phrase

摘要： 近年来词袋（bag-of-words，BoW）模型因为其较高的性能而被人们认可。词袋模型的改进方法主要包括两种，一种是在图像特征表示中加入空间信息，另一种是加入语义信息。研究了结合图像特征点间的空间和语义信息的高性能图像特征表示方法，通过计算图像中视觉词间的分布距离，并提取相似的视觉词组成视觉短语，来更好地表示图像。在UIUC-Sports8图像库和Scene-15图像库上进行图像分类实验，并与传统的词袋模型及其他模型进行比较，结果显示视觉词短语方法获得了更高的分类准确率。

关键词: 特征表示, 词袋模型, 视觉词, 视觉短语

ZHAO Yue, YU Jian, XIE Bojun. Combining Spatial and Semantic Information to Represent Image[J]. Journal of Frontiers of Computer Science and Technology, 2013, 7(10): 896-904.

赵悦，于剑，谢博鋆. 结合空间语义信息的图像表示方法[J]. 计算机科学与探索, 2013, 7(10): 896-904.

[1]	CHEN Junfen, ZHANG Ming, ZHAO Jiacheng, XIE Bojun, LI Yan. Deep Clustering Algorithm Based on Denoising and Self-Attention [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(9): 1717-1727.
[2]	SUN Dongpu, QU Li. Survey on Feature Representation and Similarity Measurement of Time Series [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(2): 195-205.
[3]	BIAN Zekang, WANG Shitong, WANG Yuxiang. Locally Linear KNN Method Based on Specific Feature Representation [J]. Journal of Frontiers of Computer Science and Technology, 2018, 12(1): 134-142.
[4]	ZHANG Yong, WANG Guoshuai. Image Representation Method of Visual Phrases Based on Support Region Detection [J]. Journal of Frontiers of Computer Science and Technology, 2015, 9(5): 629-634.

Combining Spatial and Semantic Information to Represent Image

结合空间语义信息的图像表示方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 4

Recommended Articles

Metrics