The Internet Archive has a preservation copy of this work in our general collections.
The file type is
One of the central knowledge sources of an information extraction system is a dictionary of linguistic patterns that can be used to identify the conceptual content of a text. This paper describes CRYSTAL, a system which automatically induces a dictionary of "concept-node definitions" sufficient to identify relevant information from a training corpus. Each of these concept-node definitions is generalized as far as possible without producing errors, so that a minimum number of dictionary entriesarXiv:cmp-lg/9505020v1 fatcat:3rzbbrvzcff2fdjw7ea62mv7mi