A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is
Open-ended long-form video question answering is challenging problem in visual information retrieval, which automatically generates the natural language answer from the referenced long-form video content according to the question. However, the existing video question answering works mainly focus on the short-form video question answering, due to the lack of modeling the semantic representation of long-form video contents. In this paper, we consider the problem of long-form video questiondoi:10.24963/ijcai.2018/512 dblp:conf/ijcai/ZhaoZXYYCWZ18 fatcat:n6yvgkp54rh6hksn2yi6pxdoki