A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2018; you can also visit the original URL.
The file type is application/pdf
.
Worldlex: Twitter and blog word frequencies for 66 languages
2015
Behavior Research Methods
Lexical frequency is one of the strongest predictors of word processing time. The frequencies are often calculated from book-based corpora, or more recently from subtitlebased corpora. We present new frequencies based on Twitter, blog posts, or newspapers for 66 languages. We show that these frequencies predict lexical decision reaction times similar to the already existing frequencies, or even better than them. These new frequencies are freely available and may be downloaded from
doi:10.3758/s13428-015-0621-0
pmid:26170053
fatcat:mdumvhzvxzdx5iiv3riovxajma