A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2015; you can also visit the original URL.
The file type is application/pdf
.
Enriched Format Text Categorization Using A Component Similarity Approach
2011
Journal of Software
Text categorization has been widely studied for years. However, conventional plain text categorization approaches which work good in plain text behave poor when they are simply applied to enriched format texts. An categorization approach that is applicable to enriched format text is proposed. During feature selection, we get feature structure distribution weight by using extended structure model so that structure affections to categorization are fully considered. Text formats are also taken
doi:10.4304/jsw.6.9.1713-1720
fatcat:rfvmdzqgnvd6fdnee56uyaqq4y