A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Automatic Text Categorization in Terms of Genre and Author
2000
Computational Linguistics
The two main factors that characterize a text are its content and its style. Both of them can be used as categorization means. In this paper we present an approach to text categorization in terms of genre and author for Modern Greek. In contrast to hitherto stylometric approaches, we attempt to take full advantage of existing natural language processing (NLP) tools. To this end, we propose a set of style markers including analysislevel measures that represent the way in which the input text has
doi:10.1162/089120100750105920
fatcat:ksreq6s6w5ewrgawtxvrbl5tje