Hierarchical Multi-label Classification of Text with Capsule Networks

Rami Aly, Steffen Remus, Chris Biemann
<span title="">2019</span> <i title="Association for Computational Linguistics"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/5n6volmnonf5tn6xputi5f2t3e" style="color: black;">Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop</a> </i> &nbsp;
Capsule networks have been shown to demonstrate good performance on structured data in the area of visual inference. In this paper we apply and compare simple shallow capsule networks for hierarchical multi-label text classification and show that they can perform superior to other neural networks, such as CNNs and LSTMs, and non-neural network architectures such as SVMs. For our experiments, we use the established Web of Science (WOS) dataset and introduce a new real-world scenario dataset, the
more &raquo; ... BlurbGenreCollection (BGC). Our results confirm the hypothesis that capsule networks are especially advantageous for rare events and structurally diverse categories, which we attribute to their ability to combine latent encoded information.
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.18653/v1/p19-2045">doi:10.18653/v1/p19-2045</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/acl/AlyRB19.html">dblp:conf/acl/AlyRB19</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/jtrwrqmj4bhrjjlw76bx54jryy">fatcat:jtrwrqmj4bhrjjlw76bx54jryy</a> </span>
