A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Connecting Language and Vision to Actions
2018
Proceedings of ACL 2018, Tutorial Abstracts
A long-term goal of AI research is to build intelligent agents that can see the rich visual environment around us, communicate this understanding in natural language to humans and other agents, and act in a physical or embodied environment. To this end, recent advances at the intersection of language and vision have made incredible progress -from being able to generate natural language descriptions of images/videos, to answering questions about them, to even holding freeform conversations about
doi:10.18653/v1/p18-5004
dblp:conf/acl/AndersonDW18
fatcat:ilrvhjobwrhcdf3iftjbeunbmq