Visual Question Answering Model Incorporating Multi-modal Knowledge and Supervised Retrieval
GE Yilin, SUN Haichun, YUAN Deyu
Journal of Frontiers of Computer Science and Technology . 0, (): 1 -17 .  DOI: 10.3778/j.issn.1673-9418.2407055