Microblog Retrieval Based on Concept-Enhanced Pre-Training Model

Yashen Wang, Zhaoyu Wang, Huanhuan Zhang, Zhirun Liu
2022 ACM Transactions on Knowledge Discovery from Data  
Despite substantial interest in applications of neural networks to information retrieval, neural ranking models have mostly been applied to conventional ad hoc retrieval tasks over web pages and newswire articles. This paper proposes a concept-enhanced pre-training model for microblog retrieval task, leveraging Semantic Matching Model (SMM) objective and Concept Correlation Model (CCM) objective. The proposed model is a novel neural ranking model specifically designed for ranking short-text
more » ... oblog, which could merge the advantage of pre-training methodology for generating valid contextualized embedding with the superiority of the prior lexical knowledge (e.g., concept knowledge) for understanding short-text language semantic. We conduct experiments on widely-used real-world datasets, and the experimental results demonstrate the efficiency of the proposed model, even compared with latest state-of-the-art neural-based models and pre-training based models.
doi:10.1145/3552311 fatcat:nvqy6v232fdwvjojxut7d34vs4