A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
ArraMon: A Joint Navigation-Assembly Instruction Interpretation Task in Dynamic Environments
2020
Findings of the Association for Computational Linguistics: EMNLP 2020
unpublished
For embodied agents, navigation is an important ability but not an isolated goal. Agents are also expected to perform specific tasks after reaching the target location, such as picking up objects and assembling them into a particular arrangement. We combine Vision-and-Language Navigation, assembling of collected objects, and object referring expression comprehension, to create a novel joint navigationand-assembly task, named ARRAMON. During this task, the agent (similar to a PokéMON GO player)
doi:10.18653/v1/2020.findings-emnlp.348
fatcat:w5gt46tu5rbq5gxdexhg3xsmti