A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Filters
Medical Visual Question Answering: A Survey
[article]
2022
arXiv
pre-print
Given a medical image and a clinically relevant question in natural language, the medical VQA system is expected to predict a plausible and convincing answer. ...
Our goal is to provide comprehensive information for researchers interested in medical artificial intelligence. ...
VQA-Med-2021 VQA-Med-2021 [13] is published in ImageCLEF 2021 challenge. The VQA-Med-2021 is created under the principles as those in VQA-Med-2020. ...
arXiv:2111.10056v2
fatcat:4dihtqmptbgj5lozrv3lfxqv7q
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
2021
The Journal of Artificial Intelligence Research
., image or video. ...
We extend our special thanks to Matthew Kuhn and Stephanie Lund for painstakingly proofing the whole manuscript. ...
While earlier research addresses only natural images, some approaches also incorporated medical domain knowledge to generate realistic and accurate descriptions for medical images. ...
doi:10.1613/jair.1.11688
fatcat:kvfdrg3bwrh35fns4z67adqp6i
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
[article]
2020
arXiv
pre-print
., image or video. ...
We extend our special thanks to Matthew Kuhn and Stephanie Lund for painstakingly proofing the whole manuscript. ...
For learning local features of objects in the images represented with bounding boxes, the preferred choice is to utilize region specific CNN architectures such as Region-based CNN (R-CNN) (Ren et al., ...
arXiv:1907.09358v2
fatcat:4fyf6kscy5dfbewll3zs7yzsuq