Incorporating Hierarchy into Text Encoder: a Contrastive Learning Approach for Hierarchical Text Classification [article]

Zihan Wang, Peiyi Wang, Lianzhe Huang, Xin Sun, Houfeng Wang
2022 arXiv   pre-print
Hierarchical text classification is a challenging subtask of multi-label classification due to its complex label hierarchy. Existing methods encode text and label hierarchy separately and mix their representations for classification, where the hierarchy remains unchanged for all input text. Instead of modeling them separately, in this work, we propose Hierarchy-guided Contrastive Learning (HGCLR) to directly embed the hierarchy into a text encoder. During training, HGCLR constructs positive
more » ... les for input text under the guidance of the label hierarchy. By pulling together the input text and its positive sample, the text encoder can learn to generate the hierarchy-aware text representation independently. Therefore, after training, the HGCLR enhanced text encoder can dispense with the redundant hierarchy. Extensive experiments on three benchmark datasets verify the effectiveness of HGCLR.
arXiv:2203.03825v2 fatcat:yqwfnql745eibowouhdxhi44hq