A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is
This paper explores the effectiveness of Juilland'sDas a measure of vocabulary dispersion in large corpora. Through a series of experiments using the BNC, we explored the influence of three variables: the number of corpus-parts used for the computation ofD, the frequency of the target word, and the distributions of those words. The experiments demonstrate that the effective range forDis greatly reduced when computations are based on a large number of corpus-parts: even words with highly skeweddoi:10.1075/ijcl.21.4.01bib fatcat:wwqzhpxyzjblzlgjqcb4f423y4