Dialogue act tagging with Transformation-Based Learning

Ken Samuel, Sandra Carberry, K. Vijay-Shanker
1998 Proceedings of the 36th annual meeting on Association for Computational Linguistics -  
For the task of recognizing dialogue acts, we are applying the Transformation-Based Learning (TBL) machine learning algorithm. To circumvent a sparse data problem, we extract values of well-motivated features of utterances, such as speaker direction, punctuation marks, and a new feature, called dialogue act cues, which we find to be more effective than cue phrases and word n-grams in practice. We present strategies for constructing a set of dialogue act cues automatically by minimizing the
more » ... py of the distribution of dialogue acts in a training corpus, filtering out irrelevant dialogue act cues, and clustering semantically-related words. In addition, to address limitations of TBL, we introduce a Monte Carlo strategy for training efficiently and a committee method for computing confidence measures. These ideas are combined in our working implementation, which labels held-out data as accurately as any other reported system for the dialogue act tagging task.
doi:10.3115/980691.980757 dblp:conf/acl/SamuelCV98 fatcat:dpewiu4eungvfnzuz5u2eoulna