A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit <a rel="external noopener" href="https://www.aclweb.org/anthology/2020.emnlp-main.496.pdf">the original URL</a>. The file type is <code>application/pdf</code>.
<i title="Association for Computational Linguistics">
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Large language models have recently achieved state of the art performance across a wide variety of natural language tasks. Meanwhile, the size of these models and their latency have significantly increased, which makes their usage costly, and raises an interesting question: do language models need to be large? We study this question through the lens of model compression. We present a generic, structured pruning approach by parameterizing each weight matrix using its low-rank factorization, and<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.18653/v1/2020.emnlp-main.496">doi:10.18653/v1/2020.emnlp-main.496</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/n4rj2e6carcy3kiuzm3rmv355m">fatcat:n4rj2e6carcy3kiuzm3rmv355m</a> </span>
more »... daptively removing rank-1 components during training. On language modeling tasks, our structured approach outperforms other unstructured and block-structured pruning baselines at various compression levels, while achieving significant speedups during both training and inference. We also demonstrate that our method can be applied to pruning adaptive word embeddings in large language models, and to pruning the BERT model on several downstream fine-tuning classification benchmarks. 1
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20201213025907/https://www.aclweb.org/anthology/2020.emnlp-main.496.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/07/f1/07f1755f15b5ed1851ba4449dfeab18f1de59aab.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.18653/v1/2020.emnlp-main.496"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>