BMTA: Inpainting of Large Area Damaged Images in Multiple Scenarios

doi:10.3778/j.issn.1673-9418.2406095

Abstract

Abstract: Aiming at the problems Of incoherent semantic connection between image pixels and ineffective restoration effect of local texture details in large-scale damaged images, this paper proposes a single-stage image restoration network model named BMTA(Block Of Multi Transformer Attention). It can be used to repair a large area of damaged images in multiple scenes, so that the repaired images can have a good performance in the subjective perception of human eyes and objective evaluation indicators. The generator module performs feature compression, reconstruction and enhancement of important feature information of the input image by interspersed with dual unidirectional attention modules in the convolution layer. The compressed feature information is divided into channels for local feature extraction and global feature extraction. The global information connection is established by using the segmented fringe window, and the depth of local detail information is extracted by using residual dense blocks. The extracted features are fused. In the decoder part, in order to prevent the local information loss caused by the decoding process and the inaccurate understanding of the context information during the restoration process, the gated linear self-attention module is used to ensure the multi-level retention of information in the network, so as to achieve the restoration effect closer to the original image. Finally, a discriminator is used to evaluate the repair results and promote better expressiveness of the repaired images in terms of structure and texture. The paper performs better on CelebA, StreetView, and Place2 datasets than the current advanced image restoration algorithms.

Key words: Image inpainting, Attention mechanism, Transformer, Feature extraction

摘要： 针对图像修复过程中图像像素之间语义联系不连贯、大范围损坏图像的局部纹理细节修复效果不明显的问题，文章提出一种名为BMTA(Block Of Multi Transformer Attention)的单阶段图像修复网络模型，用于修复多场景下的大面积破损图像，使修复出的图像在人眼主观感受和客观评价指标上都能有良好的表现。生成器模块通过在卷积层中穿插双重单向注意力模块来对输入图像进行特征压缩、重建和强化重要特征信息，将压缩的特征分通道进行局部特征提取和全局特征提取，利用分割条纹窗口建立全局信息联系，使用残差密集块对局部细节信息深度提取，并将所提取的特征进行融合。在解码器部分，为防止在解码过程中造成局部信息丢失和修复过程中对上下文信息理解的不准确，使用门控的线性自注意力模块来保证网络中信息的多层次保留，从而达到更接近原图的修复效果。最后使用鉴别器来评估修复结果，促使修复图像在结构和纹理上具有更好的表现性。文章在CelebA、StreetView、以及Place2数据集上的表现均优于当前先进的图像修复算法。

关键词: 图像修复, 注意力机制, Transformer, 特征提取

CAO Yan, XIN Zihao, WU Kaijun, SHAN Hongquan, GUO Bingsen. BMTA: Inpainting of Large Area Damaged Images in Multiple Scenarios[J]. Journal of Frontiers of Computer Science and Technology, DOI: 10.3778/j.issn.1673-9418.2406095.

曹岩, 辛子昊, 邬开俊, 单宏全, 郭炳森. BMTA: 多元场景下的大面积破损图像修复[J]. 计算机科学与探索, DOI: 10.3778/j.issn.1673-9418.2406095.

[1]	WANG Yonggui, LIU Danni. Cross-Domain Recommendation Algorithm Combining Multi-personalized Bridges and Self-supervised Learning [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(7): 1792-1805.
[2]	WEN Wen, DENG Fengying, HAO Zhifeng, CAI Ruichu, LIANG Fangyu. Recommendation Method for Time-Sequence Point of Interest via Spatio-Temporal Vicinity Perception [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(7): 1865-1878.
[3]	CHEN Zhongyong, HUANG Yongsheng, ZHANG Min, JIANG Ming. Study on Entity Extraction Method for Pharmaceutical Instructions Based on Pretrained Models [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(7): 1911-1922.
[4]	YUAN Heng, GENG Yikun. Feature Refinement and Multi-scale Attention for Transformer Image Denoising Network [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(7): 1838-1851.
[5]	WANG Guokai, ZHANG Xiang, WANG Shunfang. Multi-scale and Boundary Fusion Network for Skin Lesion Regions Segmentation [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(7): 1826-1837.
[6]	LI Jiancheng, CAO Lu, HE Xiquan, LIAO Junhong. Review of Classification Methods for Lung Nodules in CT Images [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(7): 1705-1724.
[7]	CHEN Dongyang, MAO Li. Research on Stock Price Prediction Integrating Incremental Learning and Transformer Model [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(7): 1889-1899.
[8]	XIA Qingfeng, XU Ke'er, LI Mingyang, HU Kai, SONG Lipeng, SONG Zhiqiang, SUN Ning. Review of Attention Mechanisms in Reinforcement Learning [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(6): 1457-1475.
[9]	YANG Li, ZHONG Junhong, ZHANG Yun, SONG Xinyu. Temporal Multimodal Sentiment Analysis with Composite Cross Modal Interaction Network [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(5): 1318-1327.
[10]	ZHANG Kaili, WANG Anzhi, XIONG Yawei, LIU Yun. Survey of Transformer-Based Single Image Dehazing Methods [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(5): 1182-1196.
[11]	WANG Xiang, MAO Li, CHEN Qidong, SUN Jun. Sentiment Analysis Combining Dynamic Gradient and Multi-view Co-attention [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(5): 1328-1338.
[12]	CHEN Linying, LIU Jianhua, ZHENG Zhixiong, LIN Jie, XU Ge, SUN Shuihua. Multi-feature Interaction for Aspect Sentiment Triplet Extraction [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(4): 1057-1067.
[13]	WANG Longye, XIAO Yue, ZENG Xiaoli, ZHANG Kaixin, MA Ao. Skin Disease Segmentation Method Combining Dense Encoder and Dual-Path Attention [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(4): 978-989.
[14]	GONG Ying, XU Wentao, ZHAO Ce, WANG Binjun. Review of Application of Generative Adversarial Networks in Image Restoration [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(3): 553-573.
[15]	CHEN Qian, HONG Zheng, SI Jianpeng. Application Layer Protocol Recognition Incorporating SENet and Transformer [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(3): 805-817.

BMTA: Inpainting of Large Area Damaged Images in Multiple Scenarios

BMTA: 多元场景下的大面积破损图像修复

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics