A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Cost-Sensitive Feature Extraction and Selection in Genre Classification
2009
Journal for Language Technology and Computational Linguistics
Automatic genre classification of Web pages is currently young compared to other Web classification tasks. Corpora are just starting to be collected and organized in a systematic way, feature extraction techniques are incon sistent and not well detailed, genres are constantly in dispute, and novel applications have not been implemented. This paper attempts to review and make progress in the area of feature extraction, an area that we believe can benefit all Web page classification, and genre
dblp:journals/ldvf/LeveringC09
fatcat:umba2miytzfrpbzw26olrokpcq