Application of the Variety-Generator Approach to Searches of Personal Names in Bibliographic Data Bases--Part 2. Optimization of Key-Sets, and Evaluation of Their Retrieval Efficiency

Dirk W. Fokker, Michael F. Lynch
1974 Information Technology and Libraries  
<p class="p1">Keys consisting of variable-length chamcter strings from the front and rear of surnames, derived by analysis of author names in a particular data base, am used to provide approximate representations of author names. When combined in appropriate ratios, and used together with keys for each of the first two initials of personal names, they provide a high degree of discrimination in search.</p> <p class="p1">Methods for optimization of key-sets are described, and the performance of
more » ... y-sets varying in size between <span class="s1">150 </span>and <span class="s1">300 </span>is determined at file sizes of up to <span class="s1">50,000 </span>name entries. The effects of varying the proportions of the queries present in the file are also examined. The results obtained with fixed-length keys are compared with those for variable-length keys, showing the latter to be greatly superior.</p> <p class="p1">Implications of the work for a variety of types of information systems are discussed.</p>
doi:10.6017/ital.v7i3.8951 fatcat:ulmo5e746vh4vjcnqppntwufqy