Visual Question Generation,ACM Computing Surveys

当前位置： X-MOL 学术 › ACM Comput. Surv. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Visual Question Generation
ACM Computing Surveys ( IF 23.8 ) Pub Date : 2020-05-29 , DOI: 10.1145/3383465
Charulata Patil ₁ , Manasi Patwardhan ₁

Affiliation

Visual question generation (VQG) is an interesting problem that has recently received attention. The task of VQG involves generating meaningful questions based on the input image. It is a multi-modal problem involving image understanding and natural language generation, especially using deep learning methods. VQG can be considered as complementary task of visual question answering. In this article, we review the current state of VQG in terms of methods to understand the problem, existing datasets to train the VQG model, evaluation metrics, and algorithms to handle the problem. Finally, we discuss the challenges that need to be conquered and the possible future directions for an effective VQG.

中文翻译：

视觉问题生成

视觉问题生成（VQG）是一个最近受到关注的有趣问题。VQG 的任务涉及根据输入图像生成有意义的问题。这是一个涉及图像理解和自然语言生成的多模态问题，尤其是使用深度学习方法。VQG 可以被认为是视觉问答的补充任务。在本文中，我们从理解问题的方法、训练 VQG 模型的现有数据集、评估指标和处理问题的算法等方面回顾了 VQG 的当前状态。最后，我们讨论了需要克服的挑战以及有效的 VQG 可能的未来方向。

更新日期：2020-05-29

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南11