• Mi Re-Unir
    Búsqueda Avanzada
    JavaScript is disabled for your browser. Some features of this site may not work without it.
    Ver ítem 
    •   Inicio
    • UNIR REVISTAS
    • Revista IJIMAI
    • 2019
    • vol. 5, nº 5, june 2019
    • Ver ítem
    •   Inicio
    • UNIR REVISTAS
    • Revista IJIMAI
    • 2019
    • vol. 5, nº 5, june 2019
    • Ver ítem

    A Novel Approach on Visual Question Answering by Parameter Prediction using Faster Region Based Convolutional Neural Network

    Autor: 
    Jha, Sudan
    ;
    Dey, Anirban
    ;
    Kumar, Raghvendra
    ;
    Kumar-Solanki, Vijender
    Fecha: 
    06/2019
    Palabra clave: 
    computer vision; neural network; natural language processing; stochastic gradient descent.; long short term memory; IJIMAI
    Tipo de Ítem: 
    article
    URI: 
    https://reunir.unir.net/handle/123456789/12505
    DOI: 
    http://doi.org/10.9781/ijimai.2018.08.004
    Dirección web: 
    https://www.ijimai.org/journal/bibcite/reference/2688
    Open Access
    Resumen:
    Visual Question Answering (VQA) is a stimulating process in the field of Natural Language Processing (NLP) and Computer Vision (CV). In this process machine can find an answer to a natural language question which is related to an image. Question can be open-ended or multiple choice. Datasets of VQA contain mainly three components; questions, images and answers. Researchers overcome the VQA problem with deep learning based architecture that jointly combines both of two networks i.e. Convolution Neural Network (CNN) for visual (image) representation and Recurrent Neural Network (RNN) with Long Short Time Memory (LSTM) for textual (question) representation and trained the combined network end to end to generate the answer. Those models are able to answer the common and simple questions that are directly related to the image’s content. But different types of questions need different level of understanding to produce correct answers. To solve this problem, we use faster Region based-CNN (R-CNN) for extracting image features with an extra fully connected layer whose weights are dynamically obtained by LSTMs cell according to the question. We claim in this paper that a single R-CNN architecture can solve the problems related to VQA by modifying weights in the parameter prediction layer. Authors trained the network end to end by Stochastic Gradient Descent (SGD) using pretrained faster R-CNN and LSTM and tested it on benchmark datasets of VQA.
    Mostrar el registro completo del ítem
    Ficheros en el ítem
    icon
    Nombre: ijimai_5_5_4_pdf_36854.pdf
    Tamaño: 1.569Mb
    Formato: application/pdf
    Ver/Abrir
    Este ítem aparece en la(s) siguiente(s) colección(es)
    • vol. 5, nº 5, june 2019

    Estadísticas de uso

    Año
    2012
    2013
    2014
    2015
    2016
    2017
    2018
    2019
    2020
    2021
    2022
    Vistas
    0
    0
    0
    0
    0
    0
    0
    0
    0
    0
    17
    Descargas
    0
    0
    0
    0
    0
    0
    0
    0
    0
    0
    49

    Ítems relacionados

    Mostrando ítems relacionados por Título, autor o materia.

    • Comparative study on ant colony optimization (ACO) and K-Means clustering approaches for jobs scheduling and energy optimization model in Internet of Things (IoT) 

      Kumar, Sumit; Kumar-Solanki, Vijender; Kumar Choudhary, Saket; Selamat, Ali; González-Crespo, Rubén (1) (International Journal of Interactive Multimedia and Artificial Intelligence, 03/2020)
      The concept of Internet of Things (IoT) was proposed by Professor Kevin Ashton of the Massachusetts Institute of Technology (MIT) in 1999. IoT is an environment that people understand in many different ways depending on ...
    • Comparative Study on Ant Colony Optimization (ACO) and K-Means Clustering Approaches for Jobs Scheduling and Energy Optimization Model in Internet of Things (IoT) 

      Kumar, Sumit; Kumar-Solanki, Vijender; Kumar Choudhary, Saket; Selamat, Ali; González-Crespo, Rubén (International Journal of Interactive Multimedia and Artificial Intelligence (IJIMAI), 03/2020)
      The concept of Internet of Things (IoT) was proposed by Professor Kevin Ashton of the Massachusetts Institute of Technology (MIT) in 1999. IoT is an environment that people understand in many different ways depending on ...
    • Spiking Activity of a LIF Neuron in Distributed Delay Framework 

      Kumar Choudhary, Saket; Singh, Karan; Kumar Solanki, Vijender (International Journal of Interactive Multimedia and Artificial Intelligence (IJIMAI), 2016)
      Evolution of membrane potential and spiking activity for a single leaky integrate-and-fire (LIF) neuron in distributed delay framework (DDF) is investigated. DDF provides a mechanism to incorporate memory element in ...

    Mi cuenta

    AccederRegistrar

    ¿necesitas ayuda?

    Manual de UsuarioAutorización TFG-M

    Listar

    todo Re-UnirComunidades y coleccionesPor fecha de publicaciónAutoresTítulosPalabras claveTipo documentoTipo de accesoEsta colecciónPor fecha de publicaciónAutoresTítulosPalabras claveTipo documentoTipo de acceso






    Aviso Legal Política de Privacidad Política de Cookies Cláusulas legales RGPD
    © UNIR - Universidad Internacional de La Rioja
     
    Aviso Legal Política de Privacidad Política de Cookies Cláusulas legales RGPD
    © UNIR - Universidad Internacional de La Rioja