A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment
[article]
2019
arXiv
pre-print
This research strives for natural language moment retrieval in long, untrimmed video streams. The problem is not trivial especially when a video contains multiple moments of interests and the language describes complex temporal dependencies, which often happens in real scenarios. We identify two crucial challenges: semantic misalignment and structural misalignment. However, existing approaches treat different moments separately and do not explicitly model complex moment-wise temporal relations.
arXiv:1812.00087v2
fatcat:cbxtybz4cnf3xbqiudlyj3rlm4