A Visually-grounded First-person Dialogue Dataset with Verbal and Non-verbal Responses

Hisashi Kamezawa
2021 Journal of Natural Language Processing  
doi:10.5715/jnlp.28.259 fatcat:vubbikfmqbea5hf62nqyfjlcqm