A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Talk2Nav: Long-Range Vision-and-Language Navigation with Dual Attention and Spatial Memory
2020
International Journal of Computer Vision
The role of robots in society keeps expanding, bringing with it the necessity of interacting and communicating with humans. In order to keep such interaction intuitive, we provide automatic wayfinding based on verbal navigational instructions. Our first contribution is the creation of a large-scale dataset with verbal navigation instructions. To this end, we have developed an interactive visual navigation environment based on Google Street View; we further design an annotation method to
doi:10.1007/s11263-020-01374-3
fatcat:dtnss4zihzbizhcbmhrqnnpkee