Audio Assistant Based Image Captioning System Using RLSTM and CNN

D Akash Reddy, T. Venkat Raju, V. Shashank
2022 International Journal for Research in Applied Science and Engineering Technology  
Abstract-- As we know, visually impaired or partially sighted people face a lot of problems reading or identifying any local scenarios. To vanquish this situation, we developed an audio-based image captioner that will identify the objects in an image and form a meaningful sentence that gives the output in the aural form. Image processing is a widely used method for developing many new applications. It isalso open source, so developers can use it easily. We used NLP (Natural Language Processing)
more » ... to understand the description of an imageand convert the text to speech. A combination of R-LSTM and CNN is used, which is nothing but a reference based long-short term memory which matches different text data and takes it as reference and gives the output. Some of the other applications of image captioning are social media platforms like Instagram, etc., virtual assistants, and video editing software.
doi:10.22214/ijraset.2022.44289 fatcat:xg6oqawmezfe3aikws5iknt3lu