A quantitative evaluation of linguistic tests for the automatic prediction of semantic markedness

Vasileios Hatzivassiloglou, Kathleen McKeown
1995 Proceedings of the 33rd annual meeting on Association for Computational Linguistics -  
We present a corpus-based study of methods that have been proposed in the linguistics literature for selecting the semantically unmarked term out of a pair of antonymous adjectives. Solutions to this problem are applicable to the more general task of selecting the positive term from the pair. Using automatically collected data, the accuracy and applicability of each method is quantified, and a statistical analysis of the significance of the results is performed. We show that some simple methods
more » ... are indeed good indicators for the answer to the problem while other proposed methods fail to perform better than would be attributable to chance. In addition, one of the simplest methods, text frequency, dominates all others. We also apply two generic statistical learning methods for combining the indications of the individual methods, and compare their performance to the simple methods. The most sophisticated complex learning method offers a small, but statistically significant, improvement over the original tests.
doi:10.3115/981658.981685 dblp:conf/acl/HatzivassiloglouM95 fatcat:ftqqpiaxu5cjhcanibfjvskcku