Recent encouraging advances in computer vision and natural language understanding shed light on a very interesting yet challenging task: asking and answering questions about a given image (VQA). The study of this research problem is still in its infancy. Most existing VQA methods are neural network-based (NN-based) solutions that pursue...
A core problem in many computer vision applications is visual recognition (including object classification, detection and localization). Recent advances in artificial neural networks (aka ”deep learning”) have significantly pushed forward the state-of-the-art visual recognition performances. However, due to the lack of semantic structure modeling, most current deep learning approaches do...