VOILA: An Optimised Dialogue System for Interactively Learning Visually-Grounded Word Meanings (Demonstration System)

Yanchao Yu, Arash Eshghi, Oliver Lemon
2017 Proceedings of the 18th Annual SIGdial Meeting on Discourse and Dialogue  
We present VOILA: an optimised, multimodal dialogue agent for interactive learning of visually grounded word meanings from a human user. VOILA is: (1) able to learn new visual categories interactively from users from scratch; (2) trained on real human-human dialogues in the same domain, and so is able to conduct natural spontaneous dialogue; (3) optimised to find the most effective trade-off between the accuracy of the visual categories it learns and the cost it incurs to users. VOILA is
more » ... d on Furhat 1 , a humanlike, multi-modal robot head with backprojection of the face, and a graphical virtual character.
doi:10.18653/v1/w17-5524 dblp:conf/sigdial/YuEL17 fatcat:e3euu6jywnajxcikviemlq442y