The Internet Archive has a preservation copy of this work in our general collections.
The file type is
Information Retrieval (IR) is an important application area of Natural Language Processing (NLP) where one encounters the genuine challenge of processing large quantities of unrestricted natural language text. While much effort has been made to apply NLP techniques to IR, very few NLP techniques have been evaluated on a document collection larger than several megabytes. Many NLP techniques are simply not efficient enough, and not robust enough, to handle a large amount of text. This paperarXiv:cmp-lg/9702009v1 fatcat:7fgaozpdqfdctext7rcdvtyhzu