Visual Question Answering
Yash Vadi
Uday Karan Kapur
21st Feb 2024
Overview
What is VQA?
Datasets
etc.
Methods
Revisiting Visual Question Answering Baselines by Jabri et al.
Revisiting Visual Question Answering Baselines
Ref: Revisiting Visual Question Answering Baselines (Nov-2016)
Results
Ablations and Error Analysis
Deep Modular Co-Attention Networks for VQA by Yu et al. (2019)
Deep Modular Co-Attention Networks for VQA by Yu et al. (2019)
How many puppies are in the image?
How many puppies can you see in the image?
Co-Attention
Deep Modular Co-Attention Networks for VQA by Yu et al. (2019)
Deep Modular Co-Attention Networks for VQA by Yu et al. (2019)
Self Attention and Guided Attention units
Credits: Yu et al. (2019)
Deep Modular Co-Attention Networks for VQA by Yu et al. (2019)
MCA Compositions
Credits: Yu et al. (2019)
Deep Modular Co-Attention Networks for VQA by Yu et al. (2019)
MCA Network (MCAN) Architecture
Credits: Yu et al. (2019)
Deep Modular Co-Attention Networks for VQA by Yu et al. (2019)
Question and Image Representation
Deep Modular Co-Attention Networks for VQA by Yu et al. (2019)
MCA Network (MCAN) Architecture
Credits: Yu et al. (2019)
Deep Modular Co-Attention Networks for VQA by Yu et al. (2019)
Deep Co-Attention Learning
Credits: Yu et al. (2019)
Deep Modular Co-Attention Networks for VQA by Yu et al. (2019)
MCA Network (MCAN) Architecture
Credits: Yu et al. (2019)
Deep Modular Co-Attention Networks for VQA by Yu et al. (2019)
Multimodal Fusion and Output Classifier
Deep Modular Co-Attention Networks for VQA by Yu et al. (2019)
Experiments
Deep Modular Co-Attention Networks for VQA by Yu et al. (2019)
Experiments
Credits: Yu et al. (2019)
Deep Modular Co-Attention Networks for VQA by Yu et al. (2019)
Experiments
Credits: Yu et al. (2019)
Deep Modular Co-Attention Networks for VQA by Yu et al. (2019)
Experiments
Credits: Yu et al. (2019)
Deep Modular Co-Attention Networks for VQA by Yu et al. (2019)
Experiments
Credits: Yu et al. (2019)
Deep Modular Co-Attention Networks for VQA by Yu et al. (2019)
Experiments
Credits: Yu et al. (2019)
Credits: Yu et al. (2019)
Deep Modular Co-Attention Networks for VQA by Yu et al. (2019)
Experiments
Credits: Yu et al. (2019)
Thank you!