Towards Fully 8-bit Integer Inference for the Transformer Model
Ye Lin, Yanyang Li, Tengbo Liu, Tong Xiao, Tongran Liu, Jingbo Zhu
<span title="">2020</span>
<i title="International Joint Conferences on Artificial Intelligence Organization">
<a target="_blank" rel="noopener" href="https://fatcat.wiki/container/vfwwmrihanevtjbbkti2kc3nke" style="color: black;">Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence</a>
</i>
., Softmax in Transformer), and make heavy use of quantization and de-quantization. ...
8-bit integer inference, as a promising direction in reducing both the latency and storage of deep neural networks, has made great progress recently. ...
Since both BERTScore and YiSi-1 use generalized precision, recall and F-score as the intrinsic metrics, while WMD, WMDo and MoverScore all use earth mover's distance as the intrinsic metric, this section ...
<span class="external-identifiers">
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.24963/ijcai.2020/516">doi:10.24963/ijcai.2020/516</a>
<a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/ijcai/ChenLXPMC20.html">dblp:conf/ijcai/ChenLXPMC20</a>
<a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/meqdgmlcg5e7vc2mcodlu5hnjy">fatcat:meqdgmlcg5e7vc2mcodlu5hnjy</a>
</span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20201106230818/https://www.ijcai.org/Proceedings/2020/0516.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext">
<button class="ui simple right pointing dropdown compact black labeled icon button serp-button">
<i class="icon ia-icon"></i>
Web Archive
[PDF]
<div class="menu fulltext-thumbnail">
<img src="https://blobs.fatcat.wiki/thumbnail/pdf/e0/92/e092977936bca9ac9491740dced4d69de6576e96.180px.jpg" alt="fulltext thumbnail" loading="lazy">
</div>
</button>
</a>
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.24963/ijcai.2020/516">
<button class="ui left aligned compact blue labeled icon button serp-button">
<i class="external alternate icon"></i>
Publisher / doi.org
</button>
</a>