A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
AUTOMATIC DIALOG ACT CORPUS CREATION FROM WEB PAGES
english
2010
Proceedings of the 12th International Conference on Enterprise Information Systems
unpublished
english
This work presents two complementary tools dedicated to the task of textual corpus creation for linguistic researches. The chosen application domain is automatic dialog acts recognition, but the proposed tools might also be applied to any other research area that is concerned with dialogs processing. The first software captures relevant dialogs from freely available resources on the World Wide Web. Filtering and parsing of these web pages is realized thanks to a set of hand-crafted rules. A
doi:10.5220/0003019501980203
fatcat:6rox3htmjzae5mengqqfrt7dx4