Journal of Frontiers of Computer Science and Technology ›› 2022, Vol. 16 ›› Issue (7): 1603-1610.DOI: 10.3778/j.issn.1673-9418.2104038
• A.pngicial Intelligence • Previous Articles Next Articles
XIA Hongbin1,2, XIAO Yifei1,+(), LIU Yuan1,2
Received:
2021-04-12
Revised:
2021-07-08
Online:
2022-07-01
Published:
2021-07-20
Supported by:
作者简介:
夏鸿斌(1972—),男,江苏无锡人,博士,副教授,CCF会员,主要研究方向为个性化推荐、自然语言处理、网络优化。 基金资助:
CLC Number:
XIA Hongbin, XIAO Yifei, LIU Yuan. Long Text Generation Adversarial Network Model with Self-Attention Mechanism[J]. Journal of Frontiers of Computer Science and Technology, 2022, 16(7): 1603-1610.
夏鸿斌, 肖奕飞, 刘渊. 融合自注意力机制的长文本生成对抗网络模型[J]. 计算机科学与探索, 2022, 16(7): 1603-1610.
Add to citation manager EndNote|Ris|BibTeX
URL: http://fcst.ceaj.org/EN/10.3778/j.issn.1673-9418.2104038
Length | MLE | SeqGAN | LeakGAN | RelGAN | SALGAN | Real |
---|---|---|---|---|---|---|
20 | 9.038 | 8.736 | 7.038 | 6.680 | 6.241 | 5.750 |
40 | 10.411 | 10.310 | 7.191 | 6.765 | 6.273 | 4.071 |
Table 1 Experimental results of synthetic data
Length | MLE | SeqGAN | LeakGAN | RelGAN | SALGAN | Real |
---|---|---|---|---|---|---|
20 | 9.038 | 8.736 | 7.038 | 6.680 | 6.241 | 5.750 |
40 | 10.411 | 10.310 | 7.191 | 6.765 | 6.273 | 4.071 |
Method | BLEU-2 | BLEU-3 | BLEU-4 | BLEU-5 | P-value |
---|---|---|---|---|---|
MLE | 0.731 | 0.497 | 0.305 | 0.189 | <10-6 |
SeqGAN | 0.812 | 0.617 | 0.418 | 0.267 | <10-6 |
LeakGAN | 0.833 | 0.635 | 0.423 | 0.277 | <10-6 |
SALGAN | 0.863 | 0.721 | 0.524 | 0.354 | <10-6 |
RelGAN | 0.849 | 0.687 | 0.502 | 0.331 | <10-6 |
Table 2 Experimental results of COCO IMAGE CAPTIONS dataset
Method | BLEU-2 | BLEU-3 | BLEU-4 | BLEU-5 | P-value |
---|---|---|---|---|---|
MLE | 0.731 | 0.497 | 0.305 | 0.189 | <10-6 |
SeqGAN | 0.812 | 0.617 | 0.418 | 0.267 | <10-6 |
LeakGAN | 0.833 | 0.635 | 0.423 | 0.277 | <10-6 |
SALGAN | 0.863 | 0.721 | 0.524 | 0.354 | <10-6 |
RelGAN | 0.849 | 0.687 | 0.502 | 0.331 | <10-6 |
Method | BLEU-2 | BLEU-3 | BLEU-4 | BLEU-5 | P-value |
---|---|---|---|---|---|
MLE | 0.768 | 0.473 | 0.240 | 0.126 | <10-6 |
SeqGAN | 0.777 | 0.491 | 0.261 | 0.138 | <10-6 |
LeakGAN | 0.826 | 0.645 | 0.437 | 0.272 | <10-6 |
SALGAN | 0.876 | 0.654 | 0.404 | 0.233 | <10-6 |
RelGAN | 0.868 | 0.686 | 0.478 | 0.301 | <10-6 |
Table 3 Experimental results of EMNLP2017 WMT NEWS dataset
Method | BLEU-2 | BLEU-3 | BLEU-4 | BLEU-5 | P-value |
---|---|---|---|---|---|
MLE | 0.768 | 0.473 | 0.240 | 0.126 | <10-6 |
SeqGAN | 0.777 | 0.491 | 0.261 | 0.138 | <10-6 |
LeakGAN | 0.826 | 0.645 | 0.437 | 0.272 | <10-6 |
SALGAN | 0.876 | 0.654 | 0.404 | 0.233 | <10-6 |
RelGAN | 0.868 | 0.686 | 0.478 | 0.301 | <10-6 |
Method | COCO IMAGE CAPTIONS | EMNLP2017 WMT NEWS |
---|---|---|
SeqGAN | (1) a man wearing a helmet and chairs in the middle of a kitchen holding a jug of milk. (2) a bathroom has a wall and shower. | (1) it takes a way for me and gives everyone think that I balance some more people don’t realize their parents. (2) i truly have been awared that the set of the supreme court failed to comment on the patient. |
LeakGAN | (1) a red bathroom with a toilet and seat. (2) black and white photograph of a plane flying in the middle of a clear blue sky. | (1) i mean it is a birthday, I’ve had a long trip and got some hundred dollars, he said. (2) there’s a lot of people submitted with these issues and think about how to do to take the same experiences and recongnize their lives, the conpany said. |
SALGAN | (1) a bird is sitting on a table. (2) a group of people sitting on the table with meat in front of a storefront. | (1) I don’t think he do not know what I think because there are things that millons of ohio are culture of the united states. (2) and later it might like the idea that it has to be a good person that in a certain time, that’s what we’re at the right thing by again, they should never be here, he said. |
RelGAN | (1) a cat is looking on a man in a bathroom. (2) a man in a kitchen preparing food on a table. | (1) he has always vowed to give him a little more on for him, and I think he is very willing for the division. (2) i never thought I could put over the line but i counldn’t quite lose my job. |
Table 4 Examples of samples generated from real datasets
Method | COCO IMAGE CAPTIONS | EMNLP2017 WMT NEWS |
---|---|---|
SeqGAN | (1) a man wearing a helmet and chairs in the middle of a kitchen holding a jug of milk. (2) a bathroom has a wall and shower. | (1) it takes a way for me and gives everyone think that I balance some more people don’t realize their parents. (2) i truly have been awared that the set of the supreme court failed to comment on the patient. |
LeakGAN | (1) a red bathroom with a toilet and seat. (2) black and white photograph of a plane flying in the middle of a clear blue sky. | (1) i mean it is a birthday, I’ve had a long trip and got some hundred dollars, he said. (2) there’s a lot of people submitted with these issues and think about how to do to take the same experiences and recongnize their lives, the conpany said. |
SALGAN | (1) a bird is sitting on a table. (2) a group of people sitting on the table with meat in front of a storefront. | (1) I don’t think he do not know what I think because there are things that millons of ohio are culture of the united states. (2) and later it might like the idea that it has to be a good person that in a certain time, that’s what we’re at the right thing by again, they should never be here, he said. |
RelGAN | (1) a cat is looking on a man in a bathroom. (2) a man in a kitchen preparing food on a table. | (1) he has always vowed to give him a little more on for him, and I think he is very willing for the division. (2) i never thought I could put over the line but i counldn’t quite lose my job. |
[1] | JONAS G, MICHAEL A, DAVID G, et al. A convolutional encoder model for neural machine translation[J]. arXiv:1611. 02344, 2016. |
[2] | YANG Z, CHEN W, WANG F, et al. Improving neural machine translation with conditional sequence generative adversarial nets[J]. arXiv:1703. 04887, 2017. |
[3] | LI J, MONROE W, SHI T, et al. Adversarial learning for neural dialogue generation[J]. arXiv:1701. 06547, 2017. |
[4] | CHO K, VAN MERRIENBOER B, GULCEHRE C, et al. Learning phrase representations using RNN encoder-decoder for statistical machine translation[J]. arXiv:1406. 1078, 2014. |
[5] | GRAVES A. Generating sequences with recurrent neural networks[J]. arXiv:1308. 0850, 2013. |
[6] |
HOCHREITER S, SCHMIDHUBER J. Long short-term memory[J]. Neural Computation, 1997, 9(8): 1735-1780.
DOI URL |
[7] | VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[J]. arXiv:1706. 03762, 2017. |
[8] | ZHANG X, LECUN Y. Text understanding from scratch[J]. arXiv:1502. 01710, 2015. |
[9] | GOODFELLOW I J, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial networks[C]// Advances in Neural Information Processing Systems 27, 2014: 2672-2680. |
[10] | YU L, ZHANG W, WANG J, et al. SeqGAN: sequence generative adversarial nets with policy gradient[J]. arXiv:1609. 05473, 2016. |
[11] | GUO J X, LU S D, CAI H, et al. Long text generation via adversarial training with leaked information[C]// Proceedings of the 32nd AAAI Conference on A.pngicial Intelligence, the 30th Innovative Applications of A.pngicial Intelligence, and the 8th AAAI Symposium on Educational Advances in A.pngicial Intelligence, New Orleans, Feb 2-7, 2018. Menlo Park: AAAI, 2018: 5141-5148. |
[12] | DETHLEFS N, CUAYÁHUITL H. Hierarchical reinforcement learning for adaptive text generation[C]// Proceedings of the 6th International Natural Language Generation Conference, Trim, Jul 7-9, 2010. Stroudsburg: ACL, 2010: 1-9. |
[13] | ALEXANDER S V, SIMON O, TOM S, et al. Feudal networks for hierarchical reinforcement learning[J]. arXiv:1703. 01161, 2017. |
[14] | PENG B, LI X, LI L, et al. Composite task-completion dialogue system via hierarchical deep reinforcement learning[J]. arXiv:1704. 03084, 2017. |
[15] | NIE W, NARODYTSKA N, PATEL A. RelGAN: relational generative adversarial networks for text generation[C]// Proceedings of the 7th International Conference on Learning Representations, New Orleans, May 6-9, 2019: 1462. |
[16] | SUTTON R S, MCALLESTER D A, SINGH S P, et al. Policy gradient methods for reinforcement learning with function approximation[C]// Advances in Neural Information Processing Systems 12, Denver, Nov 29-Dec 4, 1999. Cam-bridge: MIT Press, 1999: 1057-1063. |
[17] | THOMAS P S, BRUNSKILL E. Policy gradient methods for reinforcement learning with function approximation and action-dependent baselines[J]. arXiv:1706. 06643, 2017. |
[18] | CHE T, LI Y, ZHANG R, et al. Maximum-likelihood augmented discrete generative adversarial networks[J]. arXiv: 1702. 07983, 2017. |
[19] | PAPINENI K, ROUKOS S, WARD T, et al. Bleu: a method for automatic evaluation of machine translation[C]// Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, Philadelphia, Jul 6-12, 2002. Stroudsburg: ACL, 2002: 311-318. |
[20] | NITISH S, HINTON G E, KRIZHEVSKY A, et al. Dropout: a simple way to prevent neural networks from overfitting[J]. Journal of Machine Learning Research, 2014, 15(1): 1929-1958. |
[21] | MIRZA M, OSINDERO S. Conditional generative adversarial nets[J]. arXiv:1411. 1784, 2014. |
[1] | AN Fengping, LI Xiaowei, CAO Xiang. Medical Image Classification Algorithm Based on Weight Initialization-Sliding Window CNN [J]. Journal of Frontiers of Computer Science and Technology, 2022, 16(8): 1885-1897. |
[2] | ZENG Fanzhi, XU Luqian, ZHOU Yan, ZHOU Yuexia, LIAO Junwei. Review of Knowledge Tracing Model for Intelligent Education [J]. Journal of Frontiers of Computer Science and Technology, 2022, 16(8): 1742-1763. |
[3] | LIU Yi, LI Mengmeng, ZHENG Qibin, QIN Wei, REN Xiaoguang. Survey on Video Object Tracking Algorithms [J]. Journal of Frontiers of Computer Science and Technology, 2022, 16(7): 1504-1515. |
[4] | ZHAO Xiaoming, YANG Yijiao, ZHANG Shiqing. Survey of Deep Learning Based Multimodal Emotion Recognition [J]. Journal of Frontiers of Computer Science and Technology, 2022, 16(7): 1479-1503. |
[5] | HAN Yi, QIAO Linbo, LI Dongsheng, LIAO Xiangke. Review of Knowledge-Enhanced Pre-trained Language Models [J]. Journal of Frontiers of Computer Science and Technology, 2022, 16(7): 1439-1461. |
[6] | LIU Yafen, ZHENG Yifeng, JIANG Lingyi, LI Guohe, ZHANG Wenjie. Survey on Pseudo-Labeling Methods in Deep Semi-supervised Learning [J]. Journal of Frontiers of Computer Science and Technology, 2022, 16(6): 1279-1290. |
[7] | SUN Fangwei, LI Chengyang, XIE Yongqiang, LI Zhongbo, YANG Caidong, QI Jin. Review of Deep Learning Applied to Occluded Object Detection [J]. Journal of Frontiers of Computer Science and Technology, 2022, 16(6): 1243-1259. |
[8] | SHEN Ruicai, ZHAI Junhai, HOU Yingzhen. Multi-discriminator Generative Adversarial Networks Based on Selective Ensemble Learning [J]. Journal of Frontiers of Computer Science and Technology, 2022, 16(6): 1429-1438. |
[9] | LIN Jiawei, WANG Shitong. Deep Adversarial-Reconstruction-Classification Networks for Unsupervised Domain Adaptation [J]. Journal of Frontiers of Computer Science and Technology, 2022, 16(5): 1107-1116. |
[10] | CHENG Weiyue, ZHANG Xueqin, LIN Kezheng, LI Ao. Deep Convolutional Neural Network Algorithm Fusing Global and Local Features [J]. Journal of Frontiers of Computer Science and Technology, 2022, 16(5): 1146-1154. |
[11] | ZHONG Mengyuan, JIANG Lin. Review of Super-Resolution Image Reconstruction Algorithms [J]. Journal of Frontiers of Computer Science and Technology, 2022, 16(5): 972-990. |
[12] | PEI Lishen, ZHAO Xuezhuan. Survey of Collective Activity Recognition Based on Deep Learning [J]. Journal of Frontiers of Computer Science and Technology, 2022, 16(4): 775-790. |
[13] | XU Jia, WEI Tingting, YU Ge, HUANG Xinyue, LYU Pin. Review of Question Difficulty Evaluation Approaches [J]. Journal of Frontiers of Computer Science and Technology, 2022, 16(4): 734-759. |
[14] | ZHU Weijie, CHEN Ying. Micro-expression Recognition Convolutional Network for Dual-stream Temporal-Domain Information Interaction [J]. Journal of Frontiers of Computer Science and Technology, 2022, 16(4): 950-958. |
[15] | JIANG Yi, XU Jiajie, LIU Xu, ZHU Junwu. Research on Edge-Guided Image Repair Algorithm [J]. Journal of Frontiers of Computer Science and Technology, 2022, 16(3): 669-682. |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||
/D:/magtech/JO/Jwk3_kxyts/WEB-INF/classes/