A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Learning Open Domain Multi-hop Search Using Reinforcement Learning
2022
Proceedings of the Workshop on Structured and Unstructured Knowledge Integration (SUKI)
unpublished
We propose a method to teach an automated agent to learn how to search for multi-hop paths of relations between entities in an open domain. The method learns a policy for directing existing information retrieval and machine reading resources to focus on relevant regions of a corpus. The approach formulates the learning problem as a Markov decision process with a state representation that encodes the dynamics of the search process and a reward structure that minimizes the number of documents
doi:10.18653/v1/2022.suki-1.4
fatcat:6jjynrrjpbhwlgrbthkntaxl7u