Multilingual and multimedia information retrieval from Web documents

M. Gatius, M. Bertran, H. Rodriguez
2004 Proceedings. 15th International Workshop on Database and Expert Systems Applications, 2004.  
Web documents present new challenges to conventional Information Retrieval (IR) technologies. This paper describes how these challenges are faced in FameIR, a multilingual multimedia IR shell. In this shell Cross-Language IR (CLIR) and query expansion are performed using EuroWordNet (EWN), the best developed and most widely used lexical resource for several languages. Techniques to extract information from Web documents, Wrapper Generation (WG) techniques, are used to access a finer information
more » ... granularity than the whole Web page. By combining IR and WG techniques with the use of EWN, FameIR provides a powerful facility to perform CLIR from multimedia Web documents.
doi:10.1109/dexa.2004.1333443 dblp:conf/dexaw/GatiusBR04 fatcat:fyij46nuxfaepp7pum6tna3eki