A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Treebanking User-Generated Content: A Proposal for a Unified Representation in Universal Dependencies
2020
International Conference on Language Resources and Evaluation
The paper presents a discussion on the main linguistic phenomena of user-generated texts found in web and social media, and proposes a set of annotation guidelines for their treatment within the Universal Dependencies (UD) framework. Given on the one hand the increasing number of treebanks featuring user-generated content, and its somewhat inconsistent treatment in these resources on the other, the aim of this paper is twofold: (1) to provide a short, though comprehensive, overview of such
dblp:conf/lrec/SanguinettiBCCC20
fatcat:w65hfde6oza4tom7c5htlkviny