A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Feature-finding for text classification
1996
Digital Scholarship in the Humanities
Every man's language has, first, its individualities; secondly, the common properties of the class to which he belongs; and thirdly, words and phrases of universal use." --Samuel Taylor Coleridge (1906 [1817]). Abstract Stylometrists have proposed and used a wide variety of textual features or markers, but until recently very little attention has been focused on the question: where do textual features come from? In many text-categorization tasks the choice of textual features is a crucial
doi:10.1093/llc/11.4.163
fatcat:vhgrwbkhvbbixkrtvx4yzhzffm