News Updates Monday 25th Nov 2024 :
  • Welcome to INPRESSCO, world's leading publishers, We have served more than 10000+ authors
  • Articles are invited in engineering, science, technology, management, industrial engg, biotechnology etc.
  • Paper submission is open. Submit online or at editor.ijcet@inpressco.com
  • Our journals are indexed in NAAS, University of Regensburg Germany, Google Scholar, Cross Ref etc.
  • DOI is given to all articles

Image Captioning Model using Visual Aligning Attention and Deep Matrix Factorization


Author : Shweta Dhurmekar and Prof. Deipali Gore

Pages : 948-951
Download PDF
Abstract

Image captioning technique is a complicated task that bridges both the visual and linguistic domains. Image captioning models are required to understand the content of input images to generate sentences with human languages. The attention technique, widely used for Image Captioning task provides more accurate information. Attention technique explicitly trains the deep sequential models. In this work, we have proposed a system using visual aligning attention model and deep matrix factorization; Visual aligning attention model focuses on the region of interest using CNN and LSTM as encoder- decoder. While DMF works on refinement and assignment of image tag. The dataset used is FLICKR8k for caption generation. The experimental results show that the proposed system gives more accurate results. Captions generated are more descriptive and accurate.

Keywords: Encoder-decoder; Visual Aligning; Global Aligning; CNN; RNN; Semantic; Remote Sensing; LSTM; Language Model.

Call for Papers
  1. IJCET- Current Issue
  2. Issues are published in Feb, April, June, Aug, Oct and Dec
  3. DOI is given to all articles
  • Inpressco Google Scholar
  • Inpressco Science Central
  • Inpressco Global impact factor
  • Inpressco aap

International Press corporation is licensed under a Creative Commons Attribution-Non Commercial NoDerivs 3.0 Unported License
©2010-2023 INPRESSCO® All Rights Reserved