A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2010; you can also visit the original URL.
The file type is application/pdf
.
Grounding spatial prepositions for video search
2009
Proceedings of the 2009 international conference on Multimodal interfaces - ICMI-MLMI '09
Spatial language video retrieval is an important real-world problem that forms a test bed for evaluating semantic structures for natural language descriptions of motion on naturalistic data. Video search by natural language query requires that linguistic input be converted into structures that operate on video in order to find clips that match a query. This paper describes a framework for grounding the meaning of spatial prepositions in video. We present a library of features that can be used
doi:10.1145/1647314.1647369
dblp:conf/icmi/TellexR09
fatcat:wydadfpqbvcqto7zyiaprfs72q