A corpus analysis of simple account texts and the proposal of simplification strategies

Sandra M. Aluísio, Lucia Specia, Thiago A. S. Pardo, Erick G. Maziero, Helena M. Caseli, Renata P. M. Fortes
2008 Proceedings of the 26th annual ACM international conference on Design of communication - SIGDOC '08  
In this paper we investigate the main linguistic phenomena that can make texts complex and how they could be simplified. We focus on a corpus analysis of simple account texts available on the web for Brazilian Portuguese (BP). This study illustrates the need for text simplification to facilitate accessibility to information by poor readers and by people with cognitive disabilities. It also highlights features of simplification for BP, which may differ from other languages. Moreover, we propose
more » ... implification strategies and a Simplification Annotation Editor. This study consists of the first step towards building BP text simplification systems. One of the scenarios in which these systems could be used is that of reading electronic texts produced, e.g., by the Brazilian government or by news agencies.
doi:10.1145/1456536.1456540 dblp:conf/sigdoc/AluisioSPMCF08 fatcat:rmad3lpry5ev3onjpiws3eiyva