Wiktionary talk:Frequency lists/PG/2005/10/1-1000
One entry in the Project Gutenberg frequency lists just can't be right: the article 'a' is listed so far down the list. Every other frequency list - whether the old "Million Word Corpus", the newer British National Corpus, academic lists of frequent words, or any of several others - list 'a' in 4th or, maybe, 5th place. I have never seen a frequency list (before this) that didn't have it in the top 10. Can this be a counting error from the program? I find it very hard to believe that 'a' comes in after 'an', much less outside the top 50.
- Matt -
"Gutenberg" is not a common word - mistake in script
"Gutenberg" appears in the 200-300 list. This must be a parsing error.