Indexing of Reading Paths for a Structured Information Retrieval on the Web

Mathias Géry
2008 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology  
In this paper, we present a hyperdocument model taking into account the essential aspects of information on the Web: content, composition (logical structure) and nonlinear reading (hypertext structure). We have developed a Structured Information Retrieval System (SIRS) based on this model. Its phases of indexing and querying are based on a "reading paths" point of view of the Web: a Web site is considered as a set of potential reading paths, instead of a set of atomic and flat pages. We have
more » ... eloped an specific algorithm to index the reading paths. We present some experiments aiming at evaluating the interest of our indexing process of reading paths.
doi:10.1109/wiiat.2008.386 dblp:conf/webi/Gery08 fatcat:4b4dchshg5bufiz2mw44f62w4i