Filters








3 Hits in 4.0 sec

Medical Visual Question Answering: A Survey [article]

Zhihong Lin, Donghao Zhang, Qingyi Tac, Danli Shi, Gholamreza Haffari, Qi Wu, Mingguang He, Zongyuan Ge
2022 arXiv   pre-print
Given a medical image and a clinically relevant question in natural language, the medical VQA system is expected to predict a plausible and convincing answer.  ...  Our goal is to provide comprehensive information for researchers interested in medical artificial intelligence.  ...  VQA-Med-2021 VQA-Med-2021 [13] is published in ImageCLEF 2021 challenge. The VQA-Med-2021 is created under the principles as those in VQA-Med-2020.  ... 
arXiv:2111.10056v2 fatcat:4dihtqmptbgj5lozrv3lfxqv7q

Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods

Aditya Mogadala, Marimuthu Kalimuthu, Dietrich Klakow
2021 The Journal of Artificial Intelligence Research  
., image or video.  ...  We extend our special thanks to Matthew Kuhn and Stephanie Lund for painstakingly proofing the whole manuscript.  ...  While earlier research addresses only natural images, some approaches also incorporated medical domain knowledge to generate realistic and accurate descriptions for medical images.  ... 
doi:10.1613/jair.1.11688 fatcat:kvfdrg3bwrh35fns4z67adqp6i

Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods [article]

Aditya Mogadala and Marimuthu Kalimuthu and Dietrich Klakow
2020 arXiv   pre-print
., image or video.  ...  We extend our special thanks to Matthew Kuhn and Stephanie Lund for painstakingly proofing the whole manuscript.  ...  For learning local features of objects in the images represented with bounding boxes, the preferred choice is to utilize region specific CNN architectures such as Region-based CNN (R-CNN) (Ren et al.,  ... 
arXiv:1907.09358v2 fatcat:4fyf6kscy5dfbewll3zs7yzsuq