Statistical sense disambiguation with relatively small corpora using dictionary definitions

Alpha K. Luk
1995 Proceedings of the 33rd annual meeting on Association for Computational Linguistics -  
Corpus-based sense disambiguation methods, like most other statistical NLP approaches, suffer from the problem of data sparseness. In this paper, we describe an approach which overcomes this problem using dictionary definitions. Using the definitionbased conceptual co-occurrence data collected from the relatively small Brown corpus, our sense disambiguation system achieves an average accuracy comparable to human performance given the same contextual information.
doi:10.3115/981658.981683 dblp:conf/acl/Luk95 fatcat:irlgtwnjrfh4rp4b2wndqotrmq