Semiautomatic extension of CoreNet using a bootstrapping mechanism on corpus-based co-occurrences

Chris Biemann, Sa-Im Shin, Key-Sun Choi
2004 Proceedings of the 20th international conference on Computational Linguistics - COLING '04   unpublished
The paper describes a language-independent approach for semiautomatic extension of lexical-semantic word nets and evaluates the method on CoreNet, the Korean version of word net. In a bootstrapping fashion, the socalled 'Pendulum Algorithm' operates on word sets obtained by co-occurrence statistics on a large un-annotated corpus and keeps error propagation low by a verification step. Results are not sufficient for automatic extension, but provide a good candidate set. Further improvements are discussed.
doi:10.3115/1220355.1220533 fatcat:nb26iiolhndthmnrolbbgbd3nm