A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Visual Linguistic Model and Its Applications in Image Captioning
2020
SN Computer Science
Image captioning is a well-known task of generating textual description of a given image. Research work on this problem statement requires efforts in both computer vision and natural language processing domains to obtain better quality image descriptions. In this paper, we are proposing a new deep learning approach to generate image captions. In this approach, we generate a sequence of visual embeddings for objects and their relationships present in the image. These visual embeddings are
doi:10.1007/s42979-020-00135-w
fatcat:kzq7ekgmtfdohhp4x5jl4k3y3i