Learning How to Active Learn by Dreaming

Thuy-Trang Vu, Ming Liu, Dinh Phung, Gholamreza Haffari
2019 Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics  
Heuristic-based active learning (AL) methods are limited when the data distribution of the underlying learning problems vary. Recent data-driven AL policy learning methods are also restricted to learn from closely related domains. We introduce a new sample-efficient method that learns the AL policy directly on the target domain of interest by using wake and dream cycles. Our approach interleaves between querying the annotation of the selected datapoints to update the underlying student learner
more » ... nd improving AL policy using simulation where the current student learner acts as an imperfect annotator. We evaluate our method on cross-domain and cross-lingual text classification and named entity recognition tasks. Experimental results show that our dream-based AL policy training strategy is more effective than applying the pretrained policy without further fine-tuning, and better than the existing strong baseline methods that use heuristics or reinforcement learning.
doi:10.18653/v1/p19-1401 dblp:conf/acl/VuLPH19 fatcat:ot2afy4xobftppuk4brtd4hj7m