A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Don't Stop Pretraining: Adapt Language Models to Domains and Tasks
2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
unpublished
Language models pretrained on text from a wide variety of sources form the foundation of today's NLP. In light of the success of these broad-coverage models, we investigate whether it is still helpful to tailor a pretrained model to the domain of a target task. We present a study across four domains (biomedical and computer science publications, news, and reviews) and eight classification tasks, showing that a second phase of pretraining indomain (domain-adaptive pretraining) leads to
doi:10.18653/v1/2020.acl-main.740
fatcat:2yk5xnaigjdo7bpy6n3l32ef44