Abstract: As a typical cross-modal problem, visual question answering (VQA) has received increasing attention from the communities of computer vision and natural language processing. Reading and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results