A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
On the (non)utility of Juilland'sDto measure lexical dispersion in large corpora
2016
International Journal of Corpus Linguistics
This paper explores the effectiveness of Juilland'sDas a measure of vocabulary dispersion in large corpora. Through a series of experiments using the BNC, we explored the influence of three variables: the number of corpus-parts used for the computation ofD, the frequency of the target word, and the distributions of those words. The experiments demonstrate that the effective range forDis greatly reduced when computations are based on a large number of corpus-parts: even words with highly skewed
doi:10.1075/ijcl.21.4.01bib
fatcat:wwqzhpxyzjblzlgjqcb4f423y4