English 1-grams found in Google's "English fiction (2008)" corpus, which had no Wiktionary entry (any language) plus -in' words (see notes). Sorted by number of volumes/books the word appears in between 1950 and 2008.


  • -in/in' words are a bit messed up. lookin' was in the list as lookin (due to the import process) — I've added back the final apostrophe on all words ending with -in. It's possible some words listed as ending with -in' should just end with -in. Many blue-linked -in' words are included too.
  • *gendeman = gentleman — often the letter d represents a badly OCRed tl.
  • I excluded words ending in -in or -dy from the original list because of the above issues but they're included below.
  • hundreddollar = hundred-dollar. Compound words are often unambiguously hyphenated in the original text they were scanned from.
  • littie = little; "lndian" = Indian, i's and l's and I's are often confused by OCR.
  • retuming = returning; extremelv = extremely; c0uldn't = couldn't

top 400[edit]

next 400 (approx)[edit]

another 400 (approx)[edit]

another 1000[edit]

bottom of the top 5000[edit]

Another 5000[edit]

