Evaluating the use of linguistic information in the pre-processing phase of Text Mining

C. Fagundes da Silva, O. Santos, R. Vieira
2006 Inteligencia Artificial  
This work proposes and evaluates the use of linguistic information in the pre-processing phase for text mining tasks applied to Portuguese texts. We present several experiments comparing our proposal to the usual techniques applied in the field. The results show that the use of linguistic information in the pre-processing phase brings some improvements for both text categorization and clustering.
doi:10.4114/ia.v9i26.845 fatcat:4ruvspc65ndazon4tlecdjdhym