User:AKA MBG/Statistics:POS
This page outlines:
- Number of meanings.
- Number of empty definitions for each language.
- Number of entries for each part of speech (POS).
The parsed database name: enwikt20111008_parsed[1]
See about Part of Speech (POS) headers: Wiktionary:Entry layout explained/POS headers
Total (all entries) [edit]
Number of words (with meanings) with unknown POS: 9161
The total of all unique noun, verb, etc. (+ with empty definitions): 1219090
Number of empty definitions: 61475
Number of words (unique noun, verb, etc.) with nonempty definitions: 1157615
Number of records in the table lang_pos: 1219090
Number of words having different number of meanings / definitions [edit]
Table description:
- column 0 - number of words with empty definitions (total and for each language)
- column 1 - number of monosemous words (total and for each language)
- column 2 - number of words with two meanings, etc.
- last column ("Total") - total number of words for this language.
Only the first 9 meanings (columns) are presented in the table.
| Number of meanings: | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | Total | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| code | Total (all languages) : | 61475 | 955602 | 132279 | 40586 | 14362 | 5575 | 2894 | 1600 | 958 | 636 | 1215967 |
| ext | Extremaduran | 0 | 9 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| lbe | Lak | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| yut | Yopno | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| liv | Livonian | 1 | 140 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 141 |
| pih | Pitcairn-Norfolk | 0 | 20 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
| xfa | Faliscan | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| bew | Betawi | 0 | 29 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 32 |
| fy | West Frisian | 5 | 810 | 116 | 24 | 3 | 2 | 0 | 0 | 0 | 0 | 960 |
| obm | Moabite | 0 | 18 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
| awa | Awadhi | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| syc | Syriac | 49 | 495 | 203 | 137 | 65 | 52 | 26 | 15 | 9 | 7 | 1058 |
| dz | Dzongkha | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| ksd | Tolai | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| brc | Berbice Creole Dutch | 0 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| www | Wawa | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ny | Chewa | 0 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| hit | Hittite | 0 | 52 | 12 | 6 | 3 | 1 | 2 | 0 | 1 | 0 | 77 |
| ppm | Papuma | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| aus-bun | Bunurong | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ckt | Chukchi | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| arw | Arawak | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| dng | Dungan | 0 | 4 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| tgl | Tagalog | 5 | 676 | 107 | 25 | 6 | 0 | 0 | 0 | 0 | 0 | 819 |
| kayah | Kayah | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| tmh | Tamashaq | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| waq | Wagiman | 0 | 4 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 6 |
| ty | Tahitian | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| del | Delaware | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| agx | Aghul | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| egy | Egyptian | 0 | 170 | 11 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 181 |
| mo | Moldovan | 0 | 26 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 32 |
| chg | Chagatai | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| arg | Aragonese | 1 | 121 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 125 |
| hai | Haida | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| mul | Translingual | 559 | 28273 | 5082 | 1610 | 392 | 71 | 28 | 7 | 6 | 3 | 36031 |
| mrc | Maricopa | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| stq | Saterland Frisian | 1 | 112 | 15 | 1 | 3 | 0 | 0 | 0 | 0 | 0 | 132 |
| bku | Buhid | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| mwl | Mirandese | 0 | 22 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24 |
| ne | Nepali | 0 | 35 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 41 |
| kgg | Kusunda | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| nxe | Nage | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| arc | Aramaic | 7 | 1394 | 372 | 91 | 28 | 8 | 1 | 0 | 0 | 0 | 1901 |
| mps | Dadibi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| xhu | Hurrian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ktn | Karitiâna | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| mfe | Mauritian Creole | 0 | 12 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
| crp-gep | Greenlandic Eskimo Pidgin | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| cu | Old Church Slavonic | 1 | 1830 | 312 | 72 | 12 | 0 | 0 | 1 | 0 | 0 | 2228 |
| or | Oriya | 0 | 17 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 17 |
| ms | Malay | 1 | 377 | 34 | 6 | 2 | 0 | 0 | 0 | 0 | 0 | 420 |
| cr | Cree | 0 | 32 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 33 |
| din | Dinka | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| bzd | Bribri | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| osp | Old Spanish | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| xdc | Dacian | 0 | 39 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 39 |
| tpn | Tupinambá | 0 | 11 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 14 |
| adt | Adnyamathanha | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| kac | Jingpho | 0 | 37 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 37 |
| gul | Gullah | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| arp | Arapaho | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| kjg | Khmu | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ady | Adyghe | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| el | Greek | 65 | 12466 | 2931 | 4403 | 772 | 83 | 93 | 7 | 3 | 0 | 20823 |
| xsr | Sherpa | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ist | Istriot | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| wgy | Warrgamay | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| mus | Creek | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| sem-amm | Ammonite | 0 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| tcy | Tulu | 0 | 110 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 116 |
| mer | Meru | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| run | Rundi | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| tso | Tsonga | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| sr | Serbian | 0 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 26 |
| ryu | Okinawan | 2 | 65 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 67 |
| khv | Khwarshi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| seu | Serui-Laut | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| lo | Lao | 0 | 526 | 82 | 39 | 14 | 5 | 1 | 1 | 1 | 0 | 669 |
| rm | Romansch | 21 | 1004 | 42 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 1071 |
| xve | Venetic | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| nv | Navajo | 2 | 938 | 117 | 41 | 13 | 3 | 0 | 0 | 0 | 0 | 1114 |
| alu | 'Are'are | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ndh | Ndali | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| xno | Anglo-Norman | 128 | 1383 | 159 | 24 | 6 | 0 | 0 | 0 | 0 | 0 | 1700 |
| sc | Sardinian | 0 | 123 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 128 |
| kca | Khanty | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| tfn | Dena'ina | 0 | 21 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24 |
| ksi | I'saka | 0 | 20 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 27 |
| kju | Kashaya | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| th | Thai | 5 | 2228 | 228 | 73 | 21 | 1 | 5 | 1 | 1 | 0 | 2563 |
| ht | Haitian Creole | 2 | 708 | 32 | 6 | 2 | 1 | 1 | 0 | 0 | 0 | 752 |
| moe | Innu-aimun | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| pms | Piedmontese | 0 | 22 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 24 |
| fil | Filipino | 0 | 276 | 32 | 12 | 3 | 0 | 0 | 0 | 0 | 0 | 323 |
| gnd | Zulgo-Gemzek | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| aar | Afar | 0 | 21 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 21 |
| rtm | Rotuman | 0 | 14 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14 |
| dbl | Dyirbal | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| km | Khmer | 1 | 555 | 63 | 31 | 18 | 8 | 6 | 3 | 1 | 0 | 686 |
| wba | Warao | 0 | 21 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 21 |
| wew | Weyewa | 0 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| ajp | South Levantine Arabic | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| kv | Komi | 0 | 32 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 33 |
| bh | Bihari | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| cs | Czech | 79 | 16784 | 1685 | 284 | 111 | 33 | 6 | 5 | 0 | 3 | 18990 |
| lzh | Classical Chinese | 0 | 4 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 6 |
| phn | Phoenician | 0 | 101 | 6 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 109 |
| cpi | Chinese Pidgin English | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| mrv | Mangareva | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| omn | Minoan | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| pag | Pangasinan | 0 | 9 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
| uz | Uzbek | 0 | 201 | 38 | 12 | 1 | 1 | 0 | 0 | 0 | 0 | 253 |
| den | Slavey | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| duj | Dhuwal | 0 | 12 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
| noo | Nuu-chah-nulth | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| got | Gothic | 0 | 294 | 29 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 329 |
| sn | Shona | 0 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| kr | Kanuri | 0 | 9 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| ug | Uyghur | 0 | 189 | 12 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 203 |
| mhk | Mungaka | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ha | Hausa | 0 | 8 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 9 |
| pnw | Panyjima | 0 | 14 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
| tkl | Tokelauan | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| gml | Middle Low German | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| rej | Rejang | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| gaa | Ga | 0 | 18 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 18 |
| kab | Kabyle | 0 | 52 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 53 |
| guz | Gusii | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| xls | Lusitanian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ymm | Maay | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| tr | Turkish | 36 | 5842 | 813 | 134 | 67 | 22 | 14 | 11 | 2 | 0 | 6941 |
| ctu | Chol | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| gl | Galician | 730 | 2668 | 337 | 47 | 20 | 4 | 2 | 0 | 0 | 0 | 3808 |
| sqt | Soqotri | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| mr | Marathi | 0 | 113 | 12 | 2 | 0 | 1 | 0 | 0 | 0 | 0 | 128 |
| abe | Abenaki | 1 | 39 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 42 |
| iu | Inuktitut | 2 | 171 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 176 |
| uk | Ukrainian | 2 | 634 | 62 | 14 | 5 | 4 | 1 | 0 | 0 | 0 | 722 |
| luy | Luhya | 0 | 35 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 35 |
| cia | Cia-Cia | 0 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 52 |
| krc | Karachay-Balkar | 0 | 59 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 61 |
| wbw | Woi | 0 | 22 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
| hmn | Hmong | 1 | 47 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 50 |
| ccc | Chamicuro | 0 | 440 | 11 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 453 |
| ce | Chechen | 0 | 126 | 11 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 142 |
| mvr | Marau | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| st | Sotho | 0 | 27 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 28 |
| nys | Nyunga | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| aus-gab | Gabi | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| caa | Ch'orti' | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| fit | Meänkieli | 0 | 10 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
| mzn | Mazandarani | 0 | 27 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 29 |
| vi | Vietnamese | 276 | 6859 | 235 | 43 | 14 | 8 | 0 | 0 | 0 | 0 | 7435 |
| goh | Old High German | 1 | 831 | 69 | 5 | 2 | 0 | 0 | 0 | 0 | 0 | 908 |
| nia | Nias | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| bg | Bulgarian | 225 | 5390 | 525 | 238 | 80 | 47 | 30 | 24 | 2 | 6 | 6567 |
| haw | Hawaiian | 0 | 464 | 153 | 52 | 5 | 0 | 0 | 0 | 0 | 0 | 674 |
| apm | Chiricahua | 1 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| nds-nl | Dutch Low Saxon | 0 | 32 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 33 |
| pmt | Tuamotuan | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| ach | Acholi | 0 | 18 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 19 |
| nog | Nogai | 0 | 11 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
| pot | Potawatomi | 0 | 11 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
| sl | Slovene | 7 | 2998 | 192 | 66 | 14 | 9 | 6 | 0 | 0 | 10 | 3302 |
| frr | North Frisian | 1 | 41 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 44 |
| eo | Esperanto | 844 | 9904 | 691 | 79 | 9 | 0 | 1 | 0 | 0 | 0 | 11528 |
| sv | Swedish | 2708 | 14218 | 1889 | 491 | 215 | 82 | 26 | 20 | 17 | 3 | 19669 |
| adz | Adzera | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ca | Catalan | 1259 | 6143 | 674 | 200 | 57 | 26 | 4 | 2 | 3 | 3 | 8371 |
| zav | Yatzachi Zapotec | 0 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16 |
| mixe | Mixe | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| smn | Inari Sami | 0 | 128 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 131 |
| yap | Yapese | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| kg | Kongo | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| nl | Dutch | 2756 | 18190 | 3083 | 784 | 220 | 77 | 32 | 17 | 10 | 4 | 25173 |
| io | Ido | 325 | 2256 | 108 | 12 | 4 | 2 | 0 | 0 | 0 | 0 | 2707 |
| arl | Arabela | 0 | 36 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 36 |
| frp | Franco-Provençal | 0 | 6 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| fj | Fijian | 0 | 135 | 11 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 146 |
| kw | Cornish | 8 | 593 | 41 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 647 |
| gbb | Kaytetye | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| vec | Venetian | 201 | 1682 | 490 | 77 | 7 | 4 | 0 | 0 | 0 | 0 | 2461 |
| cst | Northern Ohlone | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| dlg | Dolgan | 0 | 11 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
| peo | Old Persian | 0 | 65 | 9 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 77 |
| ave | Avestan | 0 | 6 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 7 |
| tzm | Central Morocco Tamazight | 0 | 46 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 48 |
| ky | Kyrgyz | 0 | 142 | 9 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 153 |
| sah | Sakha | 0 | 54 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 54 |
| fo | Faroese | 30 | 2774 | 486 | 132 | 62 | 21 | 15 | 2 | 2 | 1 | 3525 |
| ie | Occidental | 0 | 11 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
| cpe-spp | Samoan Plantation Pidgin | 0 | 19 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
| nds | Low Saxon | 23 | 137 | 29 | 9 | 3 | 2 | 0 | 0 | 0 | 1 | 204 |
| ik | Inupiaq | 0 | 12 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
| tar | Tarahumara | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| aka | Akan | 0 | 23 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
| frk | Frankish | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| aeb | Tunisian Arabic | 0 | 17 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 19 |
| ru | Russian | 928 | 11537 | 2538 | 954 | 401 | 156 | 78 | 35 | 12 | 2 | 16641 |
| bcl | Bikol Central | 0 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| xvs | Vestinian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| nn | Norwegian Nynorsk | 66 | 2440 | 151 | 27 | 7 | 3 | 3 | 1 | 0 | 0 | 2698 |
| pam | Kapampangan | 0 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| jiv | Shuar | 0 | 7 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| br | Breton | 9 | 827 | 48 | 9 | 1 | 1 | 0 | 0 | 0 | 0 | 895 |
| mh | Marshallese | 1 | 184 | 9 | 6 | 5 | 0 | 0 | 0 | 0 | 1 | 206 |
| zu | Zulu | 0 | 33 | 2 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 38 |
| so | Somali | 0 | 56 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 58 |
| tpw | Old Tupi | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| moh | Mohawk | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| ki | Gikuyu | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| rw | Kinyarwanda | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| ary | Moroccan Arabic | 0 | 33 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 33 |
| solresol | Solresol | 0 | 28 | 7 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 37 |
| hu | Hungarian | 63 | 19691 | 1501 | 319 | 87 | 33 | 12 | 2 | 2 | 0 | 21710 |
| rue | Rusyn | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| yij | Yindjibarndi | 0 | 21 | 1 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 25 |
| ps | Pashto | 1 | 345 | 33 | 6 | 4 | 0 | 0 | 0 | 0 | 0 | 389 |
| vol | Volapük | 102 | 1672 | 130 | 28 | 15 | 4 | 1 | 0 | 0 | 0 | 1952 |
| pau | Palauan | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| mas | Maasai | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| sov | Sonsorolese | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| pro | Old Occitan | 14 | 196 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 219 |
| da | Danish | 1570 | 4598 | 1067 | 426 | 185 | 87 | 51 | 30 | 8 | 4 | 8026 |
| nod | Northern Thai | 0 | 36 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 36 |
| mey | Hassānīya | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| vai | Vai | 5 | 308 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 315 |
| ntj | Ngaanyatjarra | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| aak | Ankave | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| hop | Hopi | 0 | 8 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 9 |
| wa | Walloon | 0 | 60 | 3 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 65 |
| fud | Futunan | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| com | Comanche | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| et | Estonian | 25 | 2982 | 201 | 27 | 8 | 3 | 1 | 0 | 1 | 0 | 3248 |
| are | Arrernte | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| pjt | Pitjantjatjara | 0 | 129 | 32 | 11 | 2 | 0 | 0 | 0 | 0 | 0 | 174 |
| uga | Ugaritic | 0 | 151 | 8 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 161 |
| crp-tpr | Taimyr Pidgin Russian | 0 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| yo | Yoruba | 0 | 113 | 4 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 119 |
| adj | Adioukrou | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| vot | Votic | 0 | 73 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 74 |
| kaz | Kazakh | 0 | 166 | 7 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 174 |
| ltg | Latgalian | 0 | 23 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
| sat | Santali | 0 | 78 | 7 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 90 |
| bjn | Banjarese | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| tay | Atayal | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| cho | Choctaw | 0 | 81 | 22 | 7 | 6 | 2 | 0 | 0 | 0 | 0 | 118 |
| yai | Yaghnobi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| tk | Turkmen | 0 | 283 | 15 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 301 |
| bug | Buginese | 0 | 23 | 6 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 30 |
| rap | Rapa Nui | 0 | 4 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| xpo | Pochutec | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ko | Korean | 195 | 14197 | 776 | 178 | 61 | 31 | 13 | 3 | 7 | 9 | 15470 |
| os | Ossetian | 0 | 146 | 18 | 9 | 0 | 0 | 1 | 0 | 0 | 0 | 174 |
| id_ | Indonesian | 4 | 1521 | 176 | 51 | 10 | 2 | 3 | 2 | 0 | 2 | 1771 |
| myv | Erzya | 0 | 56 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 64 |
| lkt | Lakota | 1 | 27 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 31 |
| de | German | 639 | 22328 | 2631 | 599 | 240 | 91 | 45 | 16 | 3 | 2 | 26594 |
| gay | Gayo | 0 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| tgt | Tagbanwa | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| lad | Judaeo-Spanish | 19 | 944 | 43 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 1009 |
| wym | Vilamovian | 0 | 336 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 344 |
| rme | Angloromani | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| dsb | Lower Sorbian | 0 | 194 | 9 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 205 |
| gvf | Golin | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| dak | Dakota | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| bpl | Broome Pearling Lugger Pidgin | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| mvi | Miyako | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| no | Norwegian | 110 | 6024 | 470 | 93 | 39 | 9 | 3 | 0 | 0 | 0 | 6748 |
| shi | Shilha | 0 | 14 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14 |
| sux | Sumerian | 0 | 96 | 16 | 7 | 2 | 2 | 0 | 0 | 0 | 0 | 123 |
| ch | Chamorro | 0 | 84 | 10 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 96 |
| ade | Adele | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| uln | Unserdeutsch | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| meu | Motu | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| hsb | Upper Sorbian | 0 | 153 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 160 |
| en | English | 5418 | 224489 | 33973 | 9842 | 4014 | 1787 | 989 | 615 | 359 | 236 | 281722 |
| ban | Balinese | 0 | 34 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 44 |
| chn | Chinook Jargon | 0 | 65 | 12 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 79 |
| ofs | Old Frisian | 0 | 124 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 126 |
| zkz | Khazar | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| yii | Yidiny | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| blt | Tai Dam | 0 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16 |
| dbj | Ida'an | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| xdm | Edomite | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| srn | Sranan Tongo | 0 | 280 | 29 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 311 |
| ff | Fula | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| pld | Polari | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| myp | Pirahã | 0 | 9 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
| bhw | Biak | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| sa | Sanskrit | 14 | 834 | 344 | 221 | 146 | 101 | 89 | 54 | 54 | 26 | 1883 |
| gdm | Laal | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| nb | Bokmål | 8 | 744 | 77 | 22 | 14 | 5 | 3 | 1 | 1 | 0 | 875 |
| gan | Gan | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| fur | Friulian | 0 | 33 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 33 |
| aiw | Aari | 0 | 17 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 17 |
| itl | Itelmen | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| brx | Bodo | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| mn | Mongolian | 1 | 251 | 4 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 258 |
| ood | O'odham | 0 | 9 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
| kea | Kabuverdianu | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| tum | Tumbuka | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| rom | Romani | 9 | 355 | 32 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 399 |
| rup | Aromanian | 1 | 83 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 88 |
| fkv | Kven | 0 | 20 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
| gsw | Swiss German | 0 | 45 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 46 |
| to | Tongan | 0 | 23 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 27 |
| lv | Latvian | 13 | 1123 | 163 | 12 | 5 | 0 | 0 | 0 | 0 | 0 | 1316 |
| spx | South Picene | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| aus-dar | Darkinjung | 0 | 88 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 89 |
| gu | Gujarati | 0 | 82 | 11 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 96 |
| iii | Nuosu | 0 | 50 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 50 |
| pa | Punjabi | 0 | 82 | 6 | 3 | 0 | 3 | 0 | 0 | 0 | 0 | 94 |
| mpm | Yosondúa Mixtec | 0 | 11 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
| kn | Kannada | 0 | 302 | 19 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 324 |
| sjd | Kildin Sami | 0 | 10 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
| gni | Gooniyandi | 0 | 56 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 57 |
| cv | Chuvash | 0 | 23 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 27 |
| sei | Seri | 0 | 56 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 62 |
| abk | Abkhaz | 0 | 42 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 42 |
| esu | Central Alaskan Yup'ik | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| ace | Acehnese | 0 | 5 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| amk | Ambai | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| xto | Tocharian | 0 | 115 | 13 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 128 |
| kal | Greenlandic | 3 | 611 | 60 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 678 |
| har | Harari | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| roo | Rotokas | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| gmy | Mycenaean Greek | 0 | 100 | 14 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 116 |
| mod | Mobilian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| abq | Abaza | 0 | 15 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
| ruo | Istro-Romanian | 1 | 70 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 72 |
| xss | Assan | 0 | 17 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 21 |
| cab | Garifuna | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| fla | Kalispel-Pend d'Oreille | 0 | 130 | 6 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 137 |
| arn | Mapudungun | 0 | 513 | 252 | 59 | 24 | 3 | 2 | 0 | 0 | 0 | 853 |
| Chumashan | Chumashan | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| tab | Tabassaran | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| zpq | Zoogocho Zapotec | 0 | 10 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
| apy | Apalaí | 0 | 10 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
| chy | Cheyenne | 0 | 19 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
| akk | Akkadian | 0 | 88 | 19 | 8 | 5 | 2 | 1 | 0 | 1 | 0 | 124 |
| lez | Lezgian | 0 | 47 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 47 |
| ur | Urdu | 1 | 835 | 344 | 206 | 134 | 85 | 37 | 34 | 18 | 10 | 1704 |
| ulk | Meriam | 0 | 41 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 46 |
| akz | Alabama | 0 | 56 | 10 | 3 | 1 | 1 | 0 | 0 | 0 | 0 | 71 |
| kky | Guugu Yimithirr | 1 | 89 | 6 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 99 |
| alt | Altai | 0 | 47 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 47 |
| gd | Scottish Gaelic | 243 | 6182 | 1174 | 393 | 182 | 71 | 32 | 21 | 10 | 5 | 8313 |
| saw | Sawi | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| uby | Ubykh | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| hil | Hiligaynon | 3 | 1546 | 109 | 15 | 1 | 0 | 0 | 0 | 0 | 0 | 1674 |
| lb | Luxembourgish | 60 | 2962 | 394 | 73 | 17 | 3 | 1 | 0 | 0 | 0 | 3510 |
| xlc | Lycian | 0 | 34 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 34 |
| xvo | Volscian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| bou | Bondei | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| tcs | Torres Strait Creole | 0 | 106 | 7 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 116 |
| bm | Bambara | 0 | 15 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
| gn | Guaraní | 0 | 59 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 62 |
| wrp | Waropen | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| cow | Cowlitz | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| tig | Tigre | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| osc | Oscan | 0 | 10 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
| bla | Blackfoot | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| xal | Kalmyk | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| wad | Wandamen | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| na | Nauruan | 0 | 13 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 14 |
| pcm | Nigerian Pidgin | 0 | 59 | 2 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 62 |
| khw | Khowar | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| wrh | Wiradjuri | 5 | 147 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 162 |
| cjs | Shor | 0 | 23 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
| xmf | Mingrelian | 0 | 23 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
| smj | Lule Sami | 0 | 10 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
| mi | Maori | 0 | 283 | 36 | 6 | 2 | 0 | 0 | 0 | 0 | 0 | 327 |
| cic | Chickasaw | 0 | 335 | 45 | 9 | 1 | 0 | 0 | 1 | 0 | 0 | 391 |
| bho | Bhojpuri | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| hi | Hindi | 1 | 1697 | 375 | 206 | 136 | 60 | 41 | 23 | 12 | 6 | 2557 |
| mnc | Manchu | 0 | 32 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 32 |
| bpy | Bishnupriya Manipuri | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| new | Newari | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| gil | Gilbertese | 0 | 15 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 18 |
| ml | Malayalam | 2 | 115 | 6 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 124 |
| gld | Nanai | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| kjb | Q'anjob'al | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| lif | Limbu | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| grc | Ancient Greek | 54 | 3078 | 595 | 354 | 202 | 124 | 67 | 46 | 25 | 10 | 4555 |
| roa-gal | Gallo | 0 | 49 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 49 |
| xld | Lydian | 0 | 34 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 36 |
| mns | Mansi | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| chm | Mari | 0 | 34 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 34 |
| nrn | Norn | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ba | Bashkir | 0 | 140 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 145 |
| khb | Tai Lü | 0 | 8 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
| nio | Nganasan | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| min | Minangkabau | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| men | Mende | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| my | Burmese | 0 | 138 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 146 |
| ina | Interlingua | 162 | 825 | 57 | 11 | 4 | 0 | 0 | 0 | 0 | 0 | 1059 |
| stp | Tepehuán | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| nap | Neapolitan | 4 | 377 | 79 | 17 | 7 | 0 | 0 | 1 | 0 | 0 | 485 |
| cop | Coptic | 0 | 11 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
| pad | Paumarí | 0 | 9 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| wam | Massachusett | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| tvl | Tuvaluan | 0 | 11 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
| co | Corsican | 2 | 79 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 81 |
| gag | Gagauz | 0 | 23 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
| tsg | Tausug | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| zkt | Khitan | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| evn | Evenki | 0 | 42 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 45 |
| frm | Middle French | 50 | 1153 | 68 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 1274 |
| mwf | Murrinh-Patha | 0 | 12 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
| bal | Balochi | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| tpi | Tok Pisin | 1 | 305 | 42 | 11 | 0 | 2 | 1 | 0 | 0 | 0 | 362 |
| css | Southern Ohlone | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| afr | Afrikaans | 17 | 601 | 65 | 11 | 1 | 2 | 1 | 0 | 0 | 0 | 698 |
| kda | Worimi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| wlm | Middle Welsh | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| tet | Tetum | 0 | 64 | 5 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 71 |
| sgz | Sursurunga | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| umb | Umbundu | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| kpy | Koryak | 0 | 42 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 42 |
| vin | Vinza | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| yue | Cantonese | 36 | 12463 | 99 | 38 | 26 | 6 | 19 | 9 | 9 | 6 | 12711 |
| prg | Old Prussian | 0 | 224 | 25 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 250 |
| ksh | Kölsch | 0 | 48 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 50 |
| drl | Darling | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| shn | Shan | 0 | 28 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 29 |
| crg | Michif | 0 | 24 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24 |
| aib | Äynu | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| qu | Quechua | 2 | 111 | 9 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 125 |
| cax | Chiquitano | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| yi | Yiddish | 4 | 698 | 65 | 15 | 3 | 1 | 0 | 1 | 0 | 0 | 787 |
| nha | Nhanda | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| kaa | Karakalpak | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| mg | Malagasy | 6 | 65 | 15 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 90 |
| mad | Madurese | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| bjz | Baruga | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| pgl | Primitive Irish | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| dar | Dargwa | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| kpg | Kapingamarangi | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| oc | Occitan | 20 | 927 | 109 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 1062 |
| ilo | Ilokano | 0 | 25 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 27 |
| ja | Japanese | 910 | 41173 | 4355 | 1507 | 633 | 356 | 170 | 133 | 76 | 56 | 49369 |
| mnw | Mon | 0 | 10 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
| smo | Samoan | 0 | 24 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 25 |
| krl | Karelian | 0 | 104 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 104 |
| str | Saanich | 0 | 196 | 9 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 206 |
| sbf | Shabo | 0 | 65 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 68 |
| bar | Bavarian | 0 | 19 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
| nmn | Taa | 2 | 238 | 45 | 20 | 6 | 0 | 0 | 1 | 0 | 0 | 312 |
| zh-cn | Simplified Chinese | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| tzj | Tz'utujil | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| niu | Niuean | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| ess | Central Siberian Yupik | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| and | Ansus | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| pl | Polish | 357 | 8425 | 935 | 193 | 93 | 23 | 5 | 2 | 6 | 1 | 10040 |
| lld | Ladin | 0 | 33 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 34 |
| hif | Fiji Hindi | 0 | 112 | 3 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 117 |
| osx | Old Saxon | 0 | 70 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 78 |
| dlm | Dalmatian | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| lre | Laurentian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| za | Zhuang | 0 | 32 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 35 |
| ett | Etruscan | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| mnk | Mandinka | 0 | 40 | 6 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 47 |
| tna | Tacana | 0 | 13 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
| ku | Kurdish | 0 | 1016 | 42 | 8 | 0 | 8 | 0 | 0 | 0 | 0 | 1074 |
| abm | Abanyom | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| apw | Western Apache | 0 | 21 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 21 |
| zun | Zuni | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| oj | Ojibwe | 0 | 145 | 39 | 15 | 5 | 2 | 0 | 0 | 0 | 0 | 206 |
| xpu | Punic | 0 | 24 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 25 |
| xmk | Ancient Macedonian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| gut | Maléku | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| kjr | Kurudu | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| li | Limburgish | 1 | 276 | 138 | 11 | 6 | 2 | 0 | 0 | 0 | 0 | 434 |
| rar | Rarotongan | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| xcl | Classical Armenian | 81 | 2025 | 669 | 304 | 143 | 64 | 36 | 17 | 7 | 7 | 3353 |
| jao | Yanyuwa | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| hnd | Hindko | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| quc | K'iche' | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| mjg | Monguor | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| sth | Shelta | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| aus-syd | Sydney | 0 | 24 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24 |
| dv | Dhivehi | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| raj | Rajasthani | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| gez | Ge'ez | 0 | 18 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
| fon | Fon | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| zmb | Zimba | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| xeb | Eblaite | 0 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
| cuk | Kuna | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| fan | Fang | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| ara | Arabic | 30 | 1833 | 441 | 235 | 131 | 85 | 51 | 36 | 40 | 27 | 2909 |
| pt | Portuguese | 937 | 6939 | 698 | 166 | 61 | 15 | 5 | 2 | 2 | 1 | 8826 |
| mop | Mopan | 0 | 26 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 27 |
| gmh | Middle High German | 0 | 27 | 3 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 33 |
| osa | Osage | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| bn | Bengali | 1 | 1582 | 115 | 53 | 9 | 2 | 0 | 0 | 0 | 0 | 1762 |
| sh | Serbo-Croatian | 58 | 30975 | 5791 | 1807 | 553 | 216 | 98 | 42 | 18 | 24 | 39582 |
| bft | Balti | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| ln | Lingala | 0 | 63 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 66 |
| xho | Xhosa | 0 | 30 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 30 |
| fuc | Pulaar | 0 | 28 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 31 |
| ota | Ottoman Turkish | 0 | 344 | 152 | 81 | 32 | 23 | 7 | 4 | 5 | 0 | 648 |
| lmo | Lombard | 0 | 31 | 4 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 37 |
| ibo | Igbo | 1 | 21 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 25 |
| sjt | Ter Sami | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| ve | Venda | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ss | Swati | 0 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| gwc | Kalami | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| kdd | Yankunytjatjara | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| shh | Shoshone | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| ani | Andi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| nan | Min Nan | 35 | 2105 | 172 | 25 | 7 | 1 | 0 | 0 | 0 | 0 | 2345 |
| cdo | Min Dong | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| pi | Pali | 3 | 39 | 2 | 5 | 2 | 0 | 0 | 0 | 0 | 0 | 51 |
| izh | Ingrian | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| vma | Martuthunira | 0 | 101 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 109 |
| mrh | Mara Chin | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| abs | Ambonese | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| xvn | Vandalic | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| si | Sinhala | 0 | 76 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 76 |
| ho | Hiri Motu | 0 | 35 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 35 |
| brg | Baure | 0 | 17 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 18 |
| zmg | Marti Ke | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| lou | Louisiana Creole French | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| ro | Romanian | 185 | 7285 | 2288 | 408 | 433 | 24 | 18 | 4 | 4 | 1 | 10650 |
| doz | Dorze | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ga | Irish | 291 | 3767 | 666 | 235 | 101 | 50 | 18 | 9 | 3 | 1 | 5141 |
| te | Telugu | 0 | 435 | 12 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 449 |
| bo | Tibetan | 0 | 160 | 15 | 4 | 1 | 0 | 0 | 1 | 0 | 0 | 181 |
| szl | Silesian | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| nay | Ngarrindjeri | 0 | 141 | 8 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 152 |
| nov | Novial | 0 | 141 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 143 |
| juc | Jurchen | 0 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| xav | Xavante | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| pcd | Picard | 0 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| chc | Catawba | 0 | 13 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14 |
| fi | Finnish | 3496 | 49478 | 4863 | 1085 | 364 | 137 | 66 | 30 | 16 | 15 | 59550 |
| vmw | Makhuwa | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| ake | Akawaio | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| tir | Tigrinya | 0 | 238 | 12 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 253 |
| xpm | Pumpokol | 0 | 47 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 47 |
| la | Latin | 11275 | 13487 | 5684 | 2188 | 897 | 320 | 130 | 73 | 28 | 8 | 34090 |
| tt | Tatar | 0 | 542 | 26 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 572 |
| sga | Old Irish | 107 | 554 | 201 | 85 | 23 | 19 | 8 | 2 | 2 | 2 | 1003 |
| zku | Kaurna | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| crh | Crimean Tatar | 1 | 1935 | 209 | 34 | 6 | 0 | 0 | 0 | 0 | 0 | 2185 |
| oma | Omaha-Ponca | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| se | Northern Sami | 1 | 683 | 37 | 3 | 2 | 0 | 0 | 0 | 0 | 0 | 726 |
| ple | Palu'e | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| miq | Miskito | 0 | 13 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
| sco | Scots | 67 | 1212 | 154 | 40 | 17 | 5 | 2 | 2 | 0 | 0 | 1499 |
| ali | Amaimon | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| agg | Angor | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| kam | Kamba | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| cy | Welsh | 10 | 1261 | 153 | 30 | 7 | 2 | 0 | 0 | 1 | 1 | 1465 |
| kud | 'Auhelawa | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| aln | Gheg | 0 | 9 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 11 |
| ayl | Libyan Arabic | 0 | 121 | 36 | 2 | 1 | 1 | 0 | 0 | 0 | 0 | 161 |
| xtg | Gaulish | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| av | Avar | 0 | 10 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
| srq | Sirionó | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| tiw | Tiwi | 0 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| orv | Old East Slavic | 0 | 7 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| twf | Taos | 0 | 491 | 23 | 6 | 2 | 2 | 0 | 0 | 0 | 0 | 524 |
| es | Spanish | 6194 | 23070 | 3577 | 1002 | 355 | 117 | 65 | 25 | 17 | 7 | 34429 |
| wiv | Vitu | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| ks | Kashmiri | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| pon | Pohnpeian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| roa-nor | Norman | 1 | 79 | 5 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 86 |
| awk | Awabakal | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| it | Italian | 11315 | 87586 | 12726 | 2997 | 715 | 206 | 84 | 22 | 10 | 7 | 115668 |
| jv | Javanese | 0 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16 |
| ssb | Southern Sama | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| apc | North Levantine Arabic | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| eu | Basque | 21 | 914 | 53 | 10 | 2 | 1 | 0 | 0 | 0 | 0 | 1001 |
| udm | Udmurt | 0 | 12 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
| yua | Yucatec Maya | 0 | 124 | 16 | 9 | 6 | 1 | 0 | 0 | 0 | 0 | 156 |
| agj | Argobba | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| mwr | Marwari | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| kj | Ovambo | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| vmb | Mbabaram | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| lg | Luganda | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| cui | Cuiba | 0 | 13 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
| zai | Isthmus Zapotec | 1 | 21 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
| gcf | Antillean Creole | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| sd | Sindhi | 0 | 31 | 7 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 43 |
| yux | Southern Yukaghir | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| yag | Yaghan | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| scn | Sicilian | 45 | 820 | 145 | 41 | 10 | 2 | 0 | 0 | 1 | 1 | 1065 |
| ain | Ainu | 0 | 63 | 4 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 68 |
| wbp | Warlpiri | 0 | 45 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 47 |
| lvk | Lavukaleve | 0 | 12 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
| aii | Assyrian Neo-Aramaic | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| mdf | Moksha | 0 | 12 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
| owl | Old Welsh | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| be | Belarusian | 0 | 298 | 15 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 314 |
| kjh | Khakas | 0 | 133 | 20 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 156 |
| myh | Makah | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| hy | Armenian | 63 | 6220 | 1321 | 284 | 92 | 54 | 23 | 6 | 7 | 3 | 8073 |
| alq | Algonquin | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| pwn | Paiwan | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| kln | Kalenjin | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| tgk | Tajik | 0 | 389 | 78 | 39 | 17 | 9 | 2 | 3 | 1 | 0 | 538 |
| ale | Aleut | 0 | 379 | 32 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 417 |
| bdy | Bandjalang | 0 | 34 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 37 |
| yrk | Nenets | 0 | 22 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
| sma | Southern Sami | 1 | 18 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
| cjm | Eastern Cham | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| niv | Nivkh | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| lzz | Laz | 0 | 27 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 27 |
| dif | Dieri | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| ems | Alutiiq | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| yan | Mayangna | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| nxn | Ngawun | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| xta | Alcozauca Mixtec | 0 | 24 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 26 |
| udi | Udi | 0 | 33 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 36 |
| zko | Kott | 1 | 121 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 130 |
| see | Seneca | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| cmn | Mandarin | 753 | 53151 | 2930 | 593 | 222 | 134 | 128 | 91 | 89 | 90 | 58181 |
| hrx | Hunsrik | 0 | 11 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
| nah | Nahuatl | 25 | 648 | 92 | 20 | 5 | 2 | 1 | 0 | 0 | 0 | 793 |
| sms | Skolt Sami | 1 | 86 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 88 |
| tyv | Tuvan | 0 | 36 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 37 |
| wyb | Ngiyambaa | 0 | 36 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 38 |
| csb | Cassubian | 0 | 399 | 39 | 17 | 3 | 1 | 0 | 0 | 0 | 0 | 459 |
| am | Amharic | 0 | 152 | 5 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 158 |
| pdc | Pennsylvania German | 1 | 15 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 17 |
| pox | Polabian | 0 | 151 | 6 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 158 |
| ka | Georgian | 4 | 1272 | 133 | 12 | 3 | 1 | 0 | 0 | 0 | 0 | 1425 |
| dgr | Dogrib | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| xas | Kamassian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| chh | Chinook | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ang | Old English | 130 | 2509 | 670 | 252 | 83 | 27 | 3 | 2 | 0 | 1 | 3677 |
| loz | Lozi | 0 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| fa | Persian | 23 | 3497 | 832 | 382 | 170 | 55 | 31 | 19 | 13 | 5 | 5027 |
| rut | Rutul | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| aqc | Archi | 0 | 14 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16 |
| zen | Zenaga | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| knw | !Kung | 0 | 73 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 73 |
| gv | Manx | 137 | 3591 | 602 | 185 | 106 | 42 | 24 | 8 | 4 | 5 | 4704 |
| knb | Lubuagan Kalinga | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| hr | Croatian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| ish | Esan | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| rcf | Réunion Creole | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| az | Azerbaijani | 6 | 752 | 62 | 8 | 6 | 2 | 0 | 0 | 0 | 0 | 836 |
| luo | Dholuo | 0 | 32 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 35 |
| hwc | Hawaiian Pidgin | 0 | 13 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
| chr | Cherokee | 0 | 282 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 287 |
| mth | Munggui | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| tsn | Tswana | 1 | 74 | 15 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 93 |
| roa-ptg | Galician-Portuguese | 0 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 26 |
| pit | Pitta-Pitta | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| mk | Macedonian | 4 | 711 | 120 | 32 | 13 | 3 | 2 | 0 | 0 | 0 | 885 |
| jam | Jamaican Creole | 0 | 29 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 31 |
| enm | Middle English | 28 | 813 | 47 | 12 | 4 | 4 | 0 | 0 | 0 | 0 | 908 |
| aus-gun | Gunai | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| xpg | Phrygian | 0 | 16 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 18 |
| sva | Svan | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| aoz | Uab Meto | 0 | 20 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
| ay | Aymara | 0 | 28 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 30 |
| vls | Flemish | 0 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16 |
| brh | Brahui | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| kxv | Kuvi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| akl | Aklanon | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
| rhg | Rohingya | 0 | 188 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 189 |
| ceb | Cebuano | 0 | 41 | 9 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 51 |
| ase | American Sign Language | 0 | 299 | 47 | 11 | 3 | 1 | 0 | 0 | 0 | 0 | 361 |
| wo | Wolof | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 25 |
| aus-wem | Wemba-Wemba | 0 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| arz | Egyptian Arabic | 2 | 269 | 10 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 284 |
| su | Sundanese | 0 | 13 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
| dgi | Northern Dagara | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| wim | Wik-Mungknh | 0 | 24 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24 |
| inh | Ingush | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
| he | Hebrew | 122 | 4373 | 571 | 168 | 49 | 19 | 10 | 4 | 4 | 0 | 5320 |
| fr | French | 3811 | 26712 | 4501 | 1205 | 415 | 204 | 83 | 40 | 17 | 11 | 36999 |
| amu | Amuzgo | 0 | 26 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 27 |
| kbd | Kabardian | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| fro | Old French | 321 | 1737 | 237 | 42 | 13 | 1 | 0 | 0 | 0 | 0 | 2351 |
| aus-wwg | Woiwurrung | 0 | 41 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 46 |
| sk | Slovak | 8 | 983 | 110 | 24 | 11 | 2 | 1 | 0 | 0 | 0 | 1139 |
| ta | Tamil | 0 | 383 | 24 | 17 | 3 | 1 | 2 | 0 | 2 | 0 | 432 |
| dtp | Dusun | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| bvb | Bube | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| vep | Veps | 0 | 133 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 139 |
| win | Winnebago | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| mga | Middle Irish | 0 | 18 | 2 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 21 |
| ada | Adangme | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| jbo | Lojban | 66 | 1521 | 15 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1604 |
| coo | Comox | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
| om | Oromo | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
| mia | Miami-Illinois | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| xum | Umbrian | 0 | 19 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 19 |
| xrn | Arin | 0 | 31 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 33 |
| kyi | Kiput | 0 | 14 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14 |
| asm | Assamese | 0 | 20 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 21 |
| xlu | Luwian | 0 | 46 | 4 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 54 |
| lt | Lithuanian | 75 | 17248 | 2888 | 479 | 82 | 7 | 4 | 1 | 2 | 1 | 20787 |
| lua | Luba-Kasai | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| rop | Kriol | 0 | 27 | 4 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 33 |
| bua | Buryat | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| pap | Papiamento | 0 | 122 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 127 |
| xcr | Carian | 0 | 49 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 50 |
| yur | Yurok | 0 | 259 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 264 |
| ast | Asturian | 38 | 452 | 23 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 515 |
| ltc | Middle Chinese | 23 | 497 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 520 |
| xbc | Bactrian | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| mic | Mi'kmaq | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 |
| mt | Maltese | 8 | 952 | 48 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 1016 |
| lun | Lunda | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| pis | Pijin | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| zh | Chinese | 0 | 13 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 16 |
| wbb | Wabo | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| crp-rsn | Russenorsk | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| bi | Bislama | 0 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16 |
| amf | Hamer-Banna | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| non | Old Norse | 2 | 773 | 108 | 16 | 5 | 1 | 3 | 1 | 0 | 0 | 909 |
| kbc | Kadiwéu | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| kld | Gamilaraay | 2 | 122 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 140 |
| sqi | Albanian | 15 | 1562 | 170 | 36 | 5 | 3 | 3 | 0 | 0 | 0 | 1794 |
| acv | Achumawi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| wuu | Wu | 13 | 19 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 34 |
| sw | Swahili | 2 | 1969 | 57 | 11 | 1 | 0 | 1 | 1 | 0 | 0 | 2042 |
| ewe | Ewe | 0 | 484 | 59 | 16 | 9 | 1 | 1 | 0 | 0 | 0 | 570 |
| alr | Alyutor | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| amn | Amanab | 0 | 54 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 54 |
| hak | Hakka | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| tkr | Tsakhur | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| vro | Võro | 0 | 95 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 101 |
| is | Icelandic | 95 | 7230 | 1079 | 364 | 116 | 56 | 13 | 7 | 1 | 1 | 8962 |
| kum | Kumyk | 0 | 104 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 109 |
| nij | Ngaju | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
Part of speech [edit]
Total (all entries) [edit]
Number of words and senses [edit]
Rows in the table: 58
| Unique Strings | Total Word-Sense Pairs | POS | Short name | Templates | Max Senses | Entry |
|---|---|---|---|---|---|---|
| 38 | 81 | predicative | predicative | 5 | громко 5, ясно 5, легко 4 | |
| 370 | 454 | article | article | 8 | the 8, të 5, le 4 | |
| 175 | 214 | root | root | 5 | կոճ- 5, ج م ل 3, ف ق ر 3 | |
| 323 | 424 | postposition | postposition | 6 | için 6, को 6, کو 6 | |
| 586 | 854 | particle | particle | 18 | אין 18, ni 6, е 6 | |
| 1544 | 1636 | proverb | proverb | 4 | les petits ruisseaux font les grandes rivières 4, 三十年河东,三十年河西 4, 三十年河東,三十年河西 4 | |
| 1 | 1 | combined-kana character | combined-kana character | 1 | ㌈ 1 | |
| 6080 | 7994 | pronoun | pronoun | 18 | sebi 18, себи 18, się 10 | |
| 1393 | 25022 | pinyin | pinyin | 239 | yì 239, yù 164, zī 142 | |
| 12824 | 13097 | kanji | kanji | 7 | 丸 7, 赤 6, 科 6 | |
| 69 | 73 | expression | expression | 2 | aan de hand van 2, 実は 2, huisarrest geven 2 | |
| 78282 | 92574 | proper noun | proper noun | 34 | Neustadt 34, Takashi 32, こうじ 32 | |
| 10 | 15 | pronominal_adverb | pronominal adverb | 3 | waaronder 3, ernaar 2, waarop 2 | |
| 2745 | 2962 | letter | letter | 5 | ú 5, r 5, ⠋ 5 | |
| 10 | 10 | circumfix | circumfix | 1 | a- -ing 1, em- -en 1, an--ana 1 | |
| 844 | 1163 | acronym | acronym, Acronym | 24 | CED 24, CET 19, KOS 17 | |
| 163 | 882 | jyutping syllable | jyutping syllable | 30 | jyu4 30, kei4 22, zi1 22 | |
| 32563 | 32783 | hanzi | hanzi | 11 | 重 11, 书 10, 托 7 | |
| 1 | 1 | hanja reading | hanja reading | 1 | 겸 1 | |
| 1514 | 27109 | pinyin syllable | pinyin syllable | 239 | yi4 239, yu4 164, zhi4 133 | |
| 308 | 397 | affix | affix | 9 | fǎ 9, yuán 8, zhōng 8 | |
| 5 | 11 | preverb | preverb | 4 | gichi- 4, maajii- 3, ziigwan 1 | |
| 5 | 5 | lujvo | lujvo | 1 | benmro 1, mi'afra 1, xelkla 1 | |
| 26120 | 35396 | han character | han character | 17 | 正 17, 方 8, 牙 8 | |
| 3188 | 3800 | symbol | symbol | 16 | ՛ 16, z 9, Unsupported titles/Vertical line 9 | |
| 2144 | 2734 | conjunction | conjunction | 10 | че 10, da 8, да 8 | |
| 144 | 231 | counter | counter | 13 | 日 13, か 8, -hai 4 | |
| 2 | 2 | prenoun | prenoun | 1 | moose 1, aandeg 1 | |
| 3106 | 5160 | preposition | preposition | 186 | of 186, ל־ 38, на 26 | |
| 50 | 51 | katakana character | katakana character | 2 | ッ 2, ヲ 1, ィ 1 | |
| 1292 | 2269 | syllable | syllable | 44 | 가 44, 개 37, 차 35 | |
| 1343 | 1352 | gismu | gismu | 4 | bongu 4, gismu 2, jinme 2 | |
| 1874 | 2550 | abbreviation | Abbreviation | 10 | MS 10, lv 10, np 10 | |
| 36 | 45 | adnominal | adnominal | 3 | その 3, 無人 3, この 2 | |
| 3738 | 5048 | suffix | suffix | 27 | -nne 27, -ón 12, -ni 9 | |
| 15 | 15 | correlative | correlative | 1 | alial 1, aliam 1, aliel 1 | |
| 4550 | 5419 | interjection | interjection | 11 | oh 11, а 7, hey 6 | |
| 8938 | 9004 | hanja | hanja | 8 | 望 8, 得 4, 藩 4 | |
| 29 | 138 | kanji reading | kanji reading | 20 | ち 20, こう 19, てい 11 | |
| 2379 | 2592 | idiom | idiom | 4 | ubiti dve muve jednim udarcem 4, 不上不下 4, biéchūjīzhù 3 | |
| 10865 | 14795 | participle | participle | 14 | iacens 14, iaciturus 14, productus 9 | |
| 36 | 38 | measure word | measure word | 2 | 条 2, shēng 2, -টি 1 | |
| 6 | 14 | gerund | gerund | 4 | laborandum 4, definiendum 3, sufflaminandum 2 | |
| 2 | 2 | noun stem | noun stem | 1 | xòy- 1, xə́- 1 | |
| 3456 | 5996 | initialism | Initialism | 43 | AAA 43, CCA 40, DCC 27 | |
| 59 | 62 | classifier | classifier | 2 | ฉบับ 2, ຄັນ 2, ក្បាល 2 | |
| 2666 | 3486 | prefix | prefix | 15 | iš- 15, μετά 9, meta- 9 | |
| 44 | 45 | infix | infix | 2 | -n- 2, -axo- 1, -ba- 1 | |
| 1 | 2 | noun class | noun class | 2 | m 2 | |
| 595 | 775 | determiner | determiner | 6 | some 6, acelorași 6, acestor 6 | |
| 166719 | 219264 | adjective | adjective, quasi-adjective, adjectival noun | 28 | noncuple 28, langsamsten 27, schnellsten 27 | |
| 167889 | 235622 | verb | verb, verb prefix, verb form | 104 | catch 104, touch 44, वहति 41 | |
| 553313 | 714155 | noun | noun | 60 | mark 60, kōshō 55, こうしょう 55 | |
| 118 | 168 | prepositional phrase | prepositional phrase | 6 | in time 6, in front 5, all over the place 4 | |
| 16 | 16 | interfix | interfix | 1 | -a- 1, -e- 1, -e- 1 | |
| 7474 | 7720 | numeral | ordinal numeral, numeral, ordinal number, cardinal numeral, number, cardinal number | 19 | शत 19, Ↄ 6, einer 4 | |
| 3922 | 4233 | phrase | phrase | 7 | rejse sig 7, 's e ur beatha 4, the fuck 4 | |
| 35866 | 42645 | adverb | adverb | 30 | peculiarly 30, ruthfully 14, एवम् 10 |
Polysemy information [edit]
Rows in the table: 58
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| predicative | 15 | 23 | 66 | 2,13 | 2,87 |
| article | 312 | 58 | 142 | 1,23 | 2,45 |
| root | 143 | 32 | 71 | 1,22 | 2,22 |
| postposition | 265 | 58 | 159 | 1,31 | 2,74 |
| particle | 451 | 135 | 403 | 1,46 | 2,99 |
| proverb | 1466 | 78 | 170 | 1,06 | 2,18 |
| combined-kana character | 1 | 0 | 0 | 1,0 | -1,0 |
| pronoun | 4835 | 1245 | 3159 | 1,31 | 2,54 |
| pinyin | 168 | 1225 | 24854 | 17,96 | 20,29 |
| kanji | 12676 | 148 | 421 | 1,02 | 2,84 |
| expression | 65 | 4 | 8 | 1,06 | 2,0 |
| proper noun | 69197 | 9085 | 23377 | 1,18 | 2,57 |
| pronominal_adverb | 6 | 4 | 9 | 1,5 | 2,25 |
| letter | 2558 | 187 | 404 | 1,08 | 2,16 |
| circumfix | 10 | 0 | 0 | 1,0 | -1,0 |
| acronym | 706 | 138 | 457 | 1,38 | 3,31 |
| jyutping syllable | 20 | 143 | 862 | 5,41 | 6,03 |
| hanzi | 32432 | 131 | 351 | 1,01 | 2,68 |
| hanja reading | 1 | 0 | 0 | 1,0 | -1,0 |
| pinyin syllable | 209 | 1305 | 26900 | 17,91 | 20,61 |
| affix | 267 | 41 | 130 | 1,29 | 3,17 |
| preverb | 2 | 3 | 9 | 2,2 | 3,0 |
| lujvo | 5 | 0 | 0 | 1,0 | -1,0 |
| han character | 19438 | 6682 | 15958 | 1,36 | 2,39 |
| symbol | 2825 | 363 | 975 | 1,19 | 2,69 |
| conjunction | 1754 | 390 | 980 | 1,28 | 2,51 |
| counter | 102 | 42 | 129 | 1,6 | 3,07 |
| prenoun | 2 | 0 | 0 | 1,0 | -1,0 |
| preposition | 2308 | 798 | 2852 | 1,66 | 3,57 |
| katakana character | 49 | 1 | 2 | 1,02 | 2,0 |
| syllable | 1204 | 88 | 1065 | 1,76 | 12,1 |
| gismu | 1336 | 7 | 16 | 1,01 | 2,29 |
| abbreviation | 1519 | 355 | 1031 | 1,36 | 2,9 |
| adnominal | 29 | 7 | 16 | 1,25 | 2,29 |
| suffix | 2991 | 747 | 2057 | 1,35 | 2,75 |
| correlative | 15 | 0 | 0 | 1,0 | -1,0 |
| interjection | 3933 | 617 | 1486 | 1,19 | 2,41 |
| hanja | 8899 | 39 | 105 | 1,01 | 2,69 |
| kanji reading | 6 | 23 | 132 | 4,76 | 5,74 |
| idiom | 2198 | 181 | 394 | 1,09 | 2,18 |
| participle | 8186 | 2679 | 6609 | 1,36 | 2,47 |
| measure word | 34 | 2 | 4 | 1,06 | 2,0 |
| gerund | 2 | 4 | 12 | 2,33 | 3,0 |
| noun stem | 2 | 0 | 0 | 1,0 | -1,0 |
| initialism | 2365 | 1091 | 3631 | 1,73 | 3,33 |
| classifier | 56 | 3 | 6 | 1,05 | 2,0 |
| prefix | 2158 | 508 | 1328 | 1,31 | 2,61 |
| infix | 43 | 1 | 2 | 1,02 | 2,0 |
| noun class | 0 | 1 | 2 | 2,0 | 2,0 |
| determiner | 484 | 111 | 291 | 1,3 | 2,62 |
| adjective | 134029 | 32690 | 85235 | 1,32 | 2,61 |
| verb | 129731 | 38158 | 105891 | 1,4 | 2,78 |
| noun | 456777 | 96536 | 257378 | 1,29 | 2,67 |
| prepositional phrase | 83 | 35 | 85 | 1,42 | 2,43 |
| interfix | 16 | 0 | 0 | 1,0 | -1,0 |
| numeral | 7292 | 182 | 428 | 1,03 | 2,35 |
| phrase | 3669 | 253 | 564 | 1,08 | 2,23 |
| adverb | 30886 | 4980 | 11759 | 1,19 | 2,36 |
English entries [edit]
Number of words with unknown POS: 768
Number of words and senses [edit]
Rows in the table: 29
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 11 | 21 | article | 8 | the 8, a 4, da 1 |
| 347 | 813 | preposition | 186 | of 186, on 20, over 13 |
| 1 | 1 | postposition | 1 | non obst. 1 |
| 1856 | 2532 | abbreviation | 10 | MS 10, lv 10, np 10 |
| 599 | 854 | suffix | 11 | -ist 11, -es 8, -es 7 |
| 1420 | 1685 | interjection | 11 | oh 11, hey 6, eek 6 |
| 8 | 13 | particle | 4 | like 4, to 2, 2 1 |
| 8 | 9 | idiom | 2 | cheap at half the price 2, as fit as a butcher's dog 1, break one's arm patting oneself on the back 1 |
| 404 | 435 | proverb | 3 | there but for the grace of God go I 3, God works in mysterious ways 2, what you see is what you get 2 |
| 297 | 391 | pronoun | 9 | me 9, she 4, myself 4 |
| 15697 | 20581 | proper noun | 20 | Shemaiah 20, Sunda 12, Lincoln 10 |
| 3456 | 5996 | initialism | 43 | AAA 43, CCA 40, DCC 27 |
| 55 | 56 | letter | 2 | k 2, a 1, x 1 |
| 3 | 3 | circumfix | 1 | a- -ing 1, em- -en 1, en- -en 1 |
| 777 | 1092 | acronym | 24 | CED 24, CET 19, KOS 17 |
| 867 | 1140 | prefix | 9 | be- 9, meta- 9, super- 5 |
| 35 | 35 | infix | 1 | -a- 1, -axo- 1, -ba- 1 |
| 11 | 11 | affix | 1 | -fung- 1, -i- 1, -kin- 1 |
| 70 | 110 | determiner | 6 | some 6, which 5, all 4 |
| 57525 | 72320 | adjective | 28 | noncuple 28, cross-channel 25, unlachrymose 25 |
| 37002 | 53777 | verb | 104 | catch 104, touch 44, work 33 |
| 143062 | 192819 | noun | 60 | mark 60, line 52, ward 31 |
| 110 | 160 | prepositional phrase | 6 | in time 6, in front 5, all over the place 4 |
| 1025 | 1144 | phrase | 5 | excuse me 5, the fuck 4, what's up 4 |
| 11259 | 13055 | adverb | 30 | peculiarly 30, ruthfully 14, away 7 |
| 52 | 73 | symbol | 5 | A 5, AAA 4, b 4 |
| 342 | 397 | numeral | 3 | billion 3, novemdecillion 3, eighteen hundred 3 |
| 4 | 4 | interfix | 1 | -i- 1, -k- 1, -n- 1 |
| 167 | 251 | conjunction | 8 | and 8, as 7, if 6 |
Polysemy information [edit]
Rows in the table: 29
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| article | 9 | 2 | 12 | 1,91 | 6,0 |
| preposition | 252 | 95 | 561 | 2,34 | 5,91 |
| postposition | 1 | 0 | 0 | 1,0 | -1,0 |
| abbreviation | 1501 | 355 | 1031 | 1,36 | 2,9 |
| suffix | 439 | 160 | 415 | 1,43 | 2,59 |
| interjection | 1244 | 176 | 441 | 1,19 | 2,51 |
| particle | 5 | 3 | 8 | 1,62 | 2,67 |
| idiom | 7 | 1 | 2 | 1,12 | 2,0 |
| proverb | 374 | 30 | 61 | 1,08 | 2,03 |
| pronoun | 241 | 56 | 150 | 1,32 | 2,68 |
| proper noun | 12217 | 3480 | 8364 | 1,31 | 2,4 |
| initialism | 2365 | 1091 | 3631 | 1,73 | 3,33 |
| letter | 54 | 1 | 2 | 1,02 | 2,0 |
| circumfix | 3 | 0 | 0 | 1,0 | -1,0 |
| acronym | 643 | 134 | 449 | 1,41 | 3,35 |
| prefix | 688 | 179 | 452 | 1,31 | 2,53 |
| infix | 35 | 0 | 0 | 1,0 | -1,0 |
| affix | 11 | 0 | 0 | 1,0 | -1,0 |
| determiner | 49 | 21 | 61 | 1,57 | 2,9 |
| adjective | 47907 | 9618 | 24413 | 1,26 | 2,54 |
| verb | 28932 | 8070 | 24845 | 1,45 | 3,08 |
| noun | 115772 | 27290 | 77047 | 1,35 | 2,82 |
| prepositional phrase | 75 | 35 | 85 | 1,45 | 2,43 |
| phrase | 928 | 97 | 216 | 1,12 | 2,23 |
| adverb | 9931 | 1328 | 3124 | 1,16 | 2,35 |
| symbol | 41 | 11 | 32 | 1,4 | 2,91 |
| numeral | 299 | 43 | 98 | 1,16 | 2,28 |
| interfix | 4 | 0 | 0 | 1,0 | -1,0 |
| conjunction | 121 | 46 | 130 | 1,5 | 2,83 |
Russian entries [edit]
Number of words with unknown POS: 989
Number of words and senses [edit]
Rows in the table: 24
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 38 | 81 | predicative | 5 | громко 5, ясно 5, легко 4 |
| 1366 | 1514 | proper noun | 6 | Русь 6, Тура 6, Тигр 4 |
| 46 | 53 | letter | 3 | ь 3, у 2, а 2 |
| 78 | 141 | preposition | 11 | по 11, на 7, за 5 |
| 20 | 20 | acronym | 1 | СПИД 1, Би-би-си 1, ЦРУ 1 |
| 19 | 27 | prefix | 4 | за- 4, без- 2, пра- 2 |
| 1 | 103 | pinyin syllable | 103 | shì 103 |
| 1 | 1 | affix | 1 | -ун- 1 |
| 7 | 11 | determiner | 3 | всякий 3, иной 2, двое 1 |
| 23 | 26 | suffix | 2 | -ность 2, -ов 2, -то 2 |
| 1895 | 2768 | adjective | 8 | злой 8, твёрдый 6, тяжёлый 6 |
| 167 | 219 | interjection | 5 | ведь 5, ёб твою мать 4, хуй тебе 4 |
| 1508 | 2996 | verb | 10 | разводить 10, развести 10, проводить 10 |
| 9232 | 13197 | noun | 13 | ход 13, рожок 11, приём 11 |
| 31 | 44 | particle | 5 | не 5, ни 4, хули 2 |
| 50 | 53 | proverb | 3 | что посеешь, то и пожнёшь 3, куй железо, пока горячо 2, за двумя зайцами погонишься, ни одного не поймаешь 1 |
| 42 | 44 | idiom | 2 | лёгок на помине 2, на славу 2, ездить в Тулу со своим самоваром 1 |
| 146 | 206 | pronoun | 4 | этот 4, ничего 3, та 3 |
| 117 | 122 | phrase | 2 | в чём дело 2, что за чёрт 2, какого чёрта 2 |
| 636 | 894 | adverb | 7 | так 7, темно 6, ясно 5 |
| 2 | 2 | symbol | 1 | × 1, @ 1 |
| 72 | 79 | numeral | 7 | один 7, второй 2, биллион 1 |
| 62 | 107 | participle | 6 | отходя 6, используемой 4, используемого 3 |
| 51 | 67 | conjunction | 3 | как 3, или 3, чтобы 3 |
Polysemy information [edit]
Rows in the table: 24
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| predicative | 15 | 23 | 66 | 2,13 | 2,87 |
| proper noun | 1243 | 123 | 271 | 1,11 | 2,2 |
| letter | 40 | 6 | 13 | 1,15 | 2,17 |
| preposition | 49 | 29 | 92 | 1,81 | 3,17 |
| acronym | 20 | 0 | 0 | 1,0 | -1,0 |
| prefix | 13 | 6 | 14 | 1,42 | 2,33 |
| pinyin syllable | 0 | 1 | 103 | 1,03e+02 | 1,03e+02 |
| affix | 1 | 0 | 0 | 1,0 | -1,0 |
| determiner | 4 | 3 | 7 | 1,57 | 2,33 |
| suffix | 20 | 3 | 6 | 1,13 | 2,0 |
| adjective | 1381 | 514 | 1387 | 1,46 | 2,7 |
| interjection | 129 | 38 | 90 | 1,31 | 2,37 |
| verb | 746 | 762 | 2250 | 1,99 | 2,95 |
| noun | 6851 | 2381 | 6346 | 1,43 | 2,67 |
| particle | 23 | 8 | 21 | 1,42 | 2,62 |
| proverb | 48 | 2 | 5 | 1,06 | 2,5 |
| idiom | 40 | 2 | 4 | 1,05 | 2,0 |
| pronoun | 99 | 47 | 107 | 1,41 | 2,28 |
| phrase | 112 | 5 | 10 | 1,04 | 2,0 |
| adverb | 459 | 177 | 435 | 1,41 | 2,46 |
| symbol | 2 | 0 | 0 | 1,0 | -1,0 |
| numeral | 70 | 2 | 9 | 1,1 | 4,5 |
| participle | 35 | 27 | 72 | 1,73 | 2,67 |
| conjunction | 39 | 12 | 28 | 1,31 | 2,33 |
Finnish entries [edit]
Number of words with unknown POS: 145
Number of words and senses [edit]
Rows in the table: 20
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 2055 | 2400 | proper noun | 4 | Jumalan Karitsa 4, Aura 4, Soini 4 |
| 3 | 3 | letter | 1 | ä 1, ö 1, å 1 |
| 14 | 16 | preposition | 2 | kiinni 2, kautta 2, a propos 1 |
| 3 | 3 | acronym | 1 | SKY 1, STT 1, jtkn 1 |
| 138 | 153 | prefix | 3 | avo- 3, vasta- 3, vara- 2 |
| 106 | 140 | postposition | 4 | vastaan 4, asti 4, saakka 4 |
| 142 | 271 | suffix | 27 | -nne 27, -nsä 9, -ni 9 |
| 5793 | 6869 | adjective | 15 | sopiva 15, heikko 11, tavallinen 8 |
| 224 | 248 | interjection | 4 | älä 4, jaa 3, ai 2 |
| 10464 | 12963 | verb | 17 | avata 17, laskea 16, purkaa 16 |
| 34267 | 39412 | noun | 13 | kuori 13, juttu 12, varsi 11 |
| 12 | 43 | particle | 6 | -hän 6, -kaan 5, -pä 4 |
| 49 | 52 | idiom | 2 | helppoa kuin heinänteko 2, olla olevinaan 2, helppoa kuin mikä 2 |
| 40 | 43 | proverb | 2 | parempi katsoa kuin katua 2, yhteistyö on voimaa 2, rohkea rokan syö, kaino ei saa kaaliakaan 2 |
| 138 | 208 | pronoun | 5 | muu 5, kaikki 4, joku 4 |
| 3 | 3 | symbol | 1 | Gt 1, Mt 1, kt 1 |
| 152 | 161 | phrase | 2 | ole hyvä 2, olkaa hyvä 2, helpommin sanottu kuin tehty 2 |
| 189 | 193 | numeral | 3 | seiska 3, yhdes 2, sata 1 |
| 2139 | 2441 | adverb | 8 | hajalla 8, hajalle 8, niin 6 |
| 56 | 68 | conjunction | 3 | että 3, kun 3, niin kuin 3 |
Polysemy information [edit]
Rows in the table: 20
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| proper noun | 1736 | 319 | 664 | 1,17 | 2,08 |
| letter | 3 | 0 | 0 | 1,0 | -1,0 |
| preposition | 12 | 2 | 4 | 1,14 | 2,0 |
| acronym | 3 | 0 | 0 | 1,0 | -1,0 |
| prefix | 125 | 13 | 28 | 1,11 | 2,15 |
| postposition | 86 | 20 | 54 | 1,32 | 2,7 |
| suffix | 109 | 33 | 162 | 1,91 | 4,91 |
| adjective | 5052 | 741 | 1817 | 1,19 | 2,45 |
| interjection | 204 | 20 | 44 | 1,11 | 2,2 |
| verb | 8954 | 1510 | 4009 | 1,24 | 2,65 |
| noun | 30646 | 3621 | 8766 | 1,15 | 2,42 |
| particle | 1 | 11 | 42 | 3,58 | 3,82 |
| idiom | 46 | 3 | 6 | 1,06 | 2,0 |
| proverb | 37 | 3 | 6 | 1,08 | 2,0 |
| pronoun | 87 | 51 | 121 | 1,51 | 2,37 |
| symbol | 3 | 0 | 0 | 1,0 | -1,0 |
| phrase | 143 | 9 | 18 | 1,06 | 2,0 |
| numeral | 186 | 3 | 7 | 1,02 | 2,33 |
| adverb | 1914 | 225 | 527 | 1,14 | 2,34 |
| conjunction | 47 | 9 | 21 | 1,21 | 2,33 |
Ukrainian entries [edit]
Number of words with unknown POS: 1
Number of words and senses [edit]
Rows in the table: 15
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 142 | 147 | proper noun | 3 | Русь 3, Ірландія 2, Галич 2 |
| 7 | 7 | letter | 1 | є 1, Є 1, І 1 |
| 1 | 1 | preposition | 1 | без 1 |
| 1 | 1 | acronym | 1 | ЗМІ 1 |
| 26 | 27 | adjective | 2 | англійський 2, канадійський 1, лютий 1 |
| 4 | 4 | interjection | 1 | якби 1, і 1, до побачення 1 |
| 22 | 32 | verb | 3 | їсти 3, є 3, слухати 2 |
| 448 | 544 | noun | 6 | дід 6, земля 5, слово 5 |
| 2 | 3 | particle | 2 | чи 2 |
| 32 | 40 | pronoun | 2 | її 2, що 2, хто 2 |
| 1 | 2 | symbol | 2 | ’ 2 |
| 6 | 6 | phrase | 1 | я не знаю 1, у мене є питання 1, ви говорите англійською 1 |
| 9 | 9 | numeral | 1 | один 1, вісім 1, п'ять 1 |
| 12 | 26 | adverb | 11 | навмисно 11, дальше 4, тепер 2 |
| 8 | 8 | conjunction | 1 | і 1, але 1, якщо 1 |
Polysemy information [edit]
Rows in the table: 15
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| proper noun | 138 | 4 | 9 | 1,04 | 2,25 |
| letter | 7 | 0 | 0 | 1,0 | -1,0 |
| preposition | 1 | 0 | 0 | 1,0 | -1,0 |
| acronym | 1 | 0 | 0 | 1,0 | -1,0 |
| adjective | 25 | 1 | 2 | 1,04 | 2,0 |
| interjection | 4 | 0 | 0 | 1,0 | -1,0 |
| verb | 14 | 8 | 18 | 1,45 | 2,25 |
| noun | 387 | 61 | 157 | 1,21 | 2,57 |
| particle | 1 | 1 | 2 | 1,5 | 2,0 |
| pronoun | 24 | 8 | 16 | 1,25 | 2,0 |
| symbol | 0 | 1 | 2 | 2,0 | 2,0 |
| phrase | 6 | 0 | 0 | 1,0 | -1,0 |
| numeral | 9 | 0 | 0 | 1,0 | -1,0 |
| adverb | 9 | 3 | 17 | 2,17 | 5,67 |
| conjunction | 8 | 0 | 0 | 1,0 | -1,0 |
French entries [edit]
Number of words with unknown POS: 350
Number of words and senses [edit]
Rows in the table: 26
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 10 | 16 | article | 4 | le 4, de la 2, les 2 |
| 89 | 163 | preposition | 14 | à 14, sur 5, dans 4 |
| 145 | 180 | suffix | 6 | -is 6, -ant 3, -aud 3 |
| 186 | 206 | interjection | 3 | tiens 3, ouais 3, mince 2 |
| 6 | 9 | particle | 3 | ne 3, est-ce que 2, ô 1 |
| 48 | 54 | proverb | 4 | les petits ruisseaux font les grandes rivières 4, l'habit ne fait pas le moine 2, les chiens aboient, la caravane passe 2 |
| 5 | 6 | idiom | 2 | et ainsi de suite 2, avoir l'estomac dans les talons 1, poser un lapin 1 |
| 125 | 185 | pronoun | 7 | se 7, vous 7, y 3 |
| 1 | 2 | participle | 2 | chauvissant 2 |
| 1 | 2 | expression | 2 | être sur le cul 2 |
| 1834 | 2197 | proper noun | 4 | Élie 4, Élisée 4, Élisabeth 4 |
| 22 | 23 | letter | 2 | Ç 2, x 1, o 1 |
| 9 | 9 | acronym | 1 | APN 1, Assedic 1, DALO 1 |
| 109 | 115 | prefix | 2 | beau- 2, dys- 2, ferro- 2 |
| 1 | 1 | infix | 1 | -iss- 1 |
| 1 | 1 | affix | 1 | -un- 1 |
| 3 | 3 | determiner | 1 | ce 1, ledit 1, cet 1 |
| 5410 | 6382 | adjective | 8 | chaleureux 8, fini 7, aigu 6 |
| 5756 | 8988 | verb | 31 | passer 31, claquer 12, relever 12 |
| 17402 | 22443 | noun | 18 | tampon 18, verre 14, papillote 11 |
| 1 | 1 | prepositional phrase | 1 | en tête 1 |
| 8 | 10 | symbol | 2 | « 2, » 2, Mo 1 |
| 1546 | 1785 | adverb | 7 | plus 7, singulièrement 6, carrément 5 |
| 39 | 39 | numeral | 1 | billion 1, trillion 1, million 1 |
| 179 | 204 | phrase | 3 | allons-y 3, avant la lettre 2, aux calendes grecques 2 |
| 63 | 74 | conjunction | 3 | comme 3, sinon 3, alors 2 |
Polysemy information [edit]
Rows in the table: 26
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| article | 6 | 4 | 10 | 1,6 | 2,5 |
| preposition | 61 | 28 | 102 | 1,83 | 3,64 |
| suffix | 122 | 23 | 58 | 1,24 | 2,52 |
| interjection | 168 | 18 | 38 | 1,11 | 2,11 |
| particle | 4 | 2 | 5 | 1,5 | 2,5 |
| proverb | 44 | 4 | 10 | 1,12 | 2,5 |
| idiom | 4 | 1 | 2 | 1,2 | 2,0 |
| pronoun | 78 | 47 | 107 | 1,48 | 2,28 |
| participle | 0 | 1 | 2 | 2,0 | 2,0 |
| expression | 0 | 1 | 2 | 2,0 | 2,0 |
| proper noun | 1521 | 313 | 676 | 1,2 | 2,16 |
| letter | 21 | 1 | 2 | 1,05 | 2,0 |
| acronym | 9 | 0 | 0 | 1,0 | -1,0 |
| prefix | 103 | 6 | 12 | 1,06 | 2,0 |
| infix | 1 | 0 | 0 | 1,0 | -1,0 |
| affix | 1 | 0 | 0 | 1,0 | -1,0 |
| determiner | 3 | 0 | 0 | 1,0 | -1,0 |
| adjective | 4676 | 734 | 1706 | 1,18 | 2,32 |
| verb | 3924 | 1832 | 5064 | 1,56 | 2,76 |
| noun | 14141 | 3261 | 8302 | 1,29 | 2,55 |
| prepositional phrase | 1 | 0 | 0 | 1,0 | -1,0 |
| symbol | 6 | 2 | 4 | 1,25 | 2,0 |
| adverb | 1367 | 179 | 418 | 1,15 | 2,34 |
| numeral | 39 | 0 | 0 | 1,0 | -1,0 |
| phrase | 155 | 24 | 49 | 1,14 | 2,04 |
| conjunction | 54 | 9 | 20 | 1,17 | 2,22 |
German entries [edit]
Number of words with unknown POS: 242
Number of words and senses [edit]
Rows in the table: 24
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 25 | 31 | article | 2 | den 2, das 2, dem 2 |
| 2142 | 2408 | proper noun | 34 | Neustadt 34, Bosch 4, Bergen 3 |
| 26 | 26 | letter | 1 | I 1, C 1, X 1 |
| 1 | 1 | circumfix | 1 | ge- -t 1 |
| 79 | 124 | preposition | 7 | an 7, bei 7, nach 5 |
| 5 | 7 | acronym | 2 | BND 2, ÖPNV 2, LDVH 1 |
| 72 | 94 | prefix | 4 | ab- 4, um- 4, nach- 4 |
| 1 | 2 | determiner | 2 | welche 2 |
| 3410 | 5523 | adjective | 27 | hässlichsten 27, langsamsten 27, schnellsten 27 |
| 48 | 61 | suffix | 5 | -e 5, -en 4, -ig 3 |
| 101 | 116 | interjection | 5 | ach 5, bitte 3, servus 3 |
| 3114 | 4440 | verb | 14 | abgeben 14, lösen 9, schneiden 8 |
| 3 | 4 | particle | 2 | oder 2, zu 1 |
| 15754 | 18583 | noun | 15 | c. 15, Zauberer 11, Zauberin 11 |
| 37 | 40 | idiom | 2 | mitgehen lassen 2, geschehen ist geschehen 2, Äpfel mit Birnen vergleichen 2 |
| 27 | 27 | proverb | 1 | Arbeit macht frei 1, Blut ist dicker als Wasser 1, Geduld ist eine Tugend 1 |
| 101 | 152 | pronoun | 5 | deiner 5, meiner 5, alle 4 |
| 6 | 6 | symbol | 1 | ß 1, J 1, ü 1 |
| 93 | 100 | phrase | 3 | Holzweg 3, grüß Gott 2, gern geschehen 2 |
| 144 | 150 | numeral | 4 | einer 4, einem 2, einen 2 |
| 1 | 1 | interfix | 1 | -s- 1 |
| 626 | 762 | adverb | 5 | dabei 5, noch 4, ja 3 |
| 2 | 4 | participle | 3 | verzaubert 3 |
| 52 | 64 | conjunction | 4 | als 4, infolgedessen 2, soweit 2 |
Polysemy information [edit]
Rows in the table: 24
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| article | 19 | 6 | 12 | 1,24 | 2,0 |
| proper noun | 1926 | 216 | 482 | 1,12 | 2,23 |
| letter | 26 | 0 | 0 | 1,0 | -1,0 |
| circumfix | 1 | 0 | 0 | 1,0 | -1,0 |
| preposition | 59 | 20 | 65 | 1,57 | 3,25 |
| acronym | 3 | 2 | 4 | 1,4 | 2,0 |
| prefix | 59 | 13 | 35 | 1,31 | 2,69 |
| determiner | 0 | 1 | 2 | 2,0 | 2,0 |
| adjective | 2917 | 493 | 2606 | 1,62 | 5,29 |
| suffix | 42 | 6 | 19 | 1,27 | 3,17 |
| interjection | 91 | 10 | 25 | 1,15 | 2,5 |
| verb | 2318 | 796 | 2122 | 1,43 | 2,67 |
| particle | 2 | 1 | 2 | 1,33 | 2,0 |
| noun | 13763 | 1991 | 4820 | 1,18 | 2,42 |
| idiom | 34 | 3 | 6 | 1,08 | 2,0 |
| proverb | 27 | 0 | 0 | 1,0 | -1,0 |
| pronoun | 67 | 34 | 85 | 1,5 | 2,5 |
| symbol | 6 | 0 | 0 | 1,0 | -1,0 |
| phrase | 87 | 6 | 13 | 1,08 | 2,17 |
| numeral | 140 | 4 | 10 | 1,04 | 2,5 |
| interfix | 1 | 0 | 0 | 1,0 | -1,0 |
| adverb | 525 | 101 | 237 | 1,22 | 2,35 |
| participle | 1 | 1 | 3 | 2,0 | 3,0 |
| conjunction | 42 | 10 | 22 | 1,23 | 2,2 |
Serbian entries [edit]
Number of words with unknown POS: null
Number of words and senses [edit]
Rows in the table: 3
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 1 | 1 | proper noun | 1 | Dušan 1 |
| 24 | 24 | letter | 1 | в 1, з 1, н 1 |
| 1 | 1 | adverb | 1 | џ 1 |
Polysemy information [edit]
Rows in the table: 3
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| proper noun | 1 | 0 | 0 | 1,0 | -1,0 |
| letter | 24 | 0 | 0 | 1,0 | -1,0 |
| adverb | 1 | 0 | 0 | 1,0 | -1,0 |
Tatar entries [edit]
Number of words with unknown POS: null
Number of words and senses [edit]
Rows in the table: 11
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 50 | 53 | proper noun | 2 | Paris 2, Éstonia 2, Neptun 2 |
| 1 | 1 | letter | 1 | ү 1 |
| 50 | 50 | adjective | 1 | әрмәни 1, hart 1, вәхши 1 |
| 2 | 3 | suffix | 2 | -ле 2, -лы 1 |
| 1 | 1 | interjection | 1 | әйе 1 |
| 5 | 5 | verb | 1 | абын 1, сабир 1, сабыр 1 |
| 372 | 400 | noun | 3 | çirek 3, дәвер 3, заман 3 |
| 1 | 1 | pronoun | 1 | sin 1 |
| 15 | 17 | adverb | 2 | зерә 2, әрмәнчә 2, töpede 1 |
| 72 | 72 | numeral | 1 | trillion 1, өч йөз 1, öç yöz 1 |
| 3 | 3 | conjunction | 1 | belän 1, белән 1, билан 1 |
Polysemy information [edit]
Rows in the table: 11
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| proper noun | 47 | 3 | 6 | 1,06 | 2,0 |
| letter | 1 | 0 | 0 | 1,0 | -1,0 |
| adjective | 50 | 0 | 0 | 1,0 | -1,0 |
| suffix | 1 | 1 | 2 | 1,5 | 2,0 |
| interjection | 1 | 0 | 0 | 1,0 | -1,0 |
| verb | 5 | 0 | 0 | 1,0 | -1,0 |
| noun | 348 | 24 | 52 | 1,08 | 2,17 |
| pronoun | 1 | 0 | 0 | 1,0 | -1,0 |
| adverb | 13 | 2 | 4 | 1,13 | 2,0 |
| numeral | 72 | 0 | 0 | 1,0 | -1,0 |
| conjunction | 3 | 0 | 0 | 1,0 | -1,0 |
Esperanto entries [edit]
Number of words with unknown POS: 108
Number of words and senses [edit]
Rows in the table: 19
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 2 | 2 | article | 1 | la 1, l' 1 |
| 3 | 3 | expression | 1 | tiel 1, aŭ 1, tiom 1 |
| 525 | 550 | proper noun | 2 | Marso 2, Patro Kristnasko 2, Eŭropo 2 |
| 56 | 56 | letter | 1 | I 1, a 1, C 1 |
| 48 | 69 | preposition | 4 | dum 4, kun 4, antaŭ 3 |
| 20 | 24 | prefix | 3 | pra- 3, eks- 2, i- 1 |
| 37 | 40 | determiner | 3 | ĉia 3, tia 1, ties 1 |
| 108 | 134 | suffix | 3 | -a 3, -ad- 3, -ant- 3 |
| 1503 | 1619 | adjective | 3 | agrabla 3, sama 3, malafabla 3 |
| 15 | 15 | correlative | 1 | alial 1, aliam 1, aliel 1 |
| 26 | 29 | interjection | 2 | ĉaŭ 2, fek 2, gesinjoroj 2 |
| 1412 | 1571 | verb | 4 | zorgi 4, bari 3, debati 3 |
| 6010 | 6453 | noun | 6 | punkto 6, batilo 4, edzo 4 |
| 10 | 15 | particle | 3 | ne 3, ĉi 3, ĉu 2 |
| 61 | 70 | pronoun | 3 | kiun 3, vi 2, kiu 2 |
| 46 | 47 | phrase | 2 | mi petas 2, mi malsatas 1, mi soifas 1 |
| 108 | 120 | numeral | 2 | ses 2, sescent 2, sesdek 2 |
| 596 | 647 | adverb | 3 | nepre 3, tiel 3, okaze 3 |
| 18 | 20 | conjunction | 2 | ankoraŭ 2, ĉar 2, plus 1 |
Polysemy information [edit]
Rows in the table: 19
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| article | 2 | 0 | 0 | 1,0 | -1,0 |
| expression | 3 | 0 | 0 | 1,0 | -1,0 |
| proper noun | 500 | 25 | 50 | 1,05 | 2,0 |
| letter | 56 | 0 | 0 | 1,0 | -1,0 |
| preposition | 34 | 14 | 35 | 1,44 | 2,5 |
| prefix | 17 | 3 | 7 | 1,2 | 2,33 |
| determiner | 35 | 2 | 5 | 1,08 | 2,5 |
| suffix | 89 | 19 | 45 | 1,24 | 2,37 |
| adjective | 1397 | 106 | 222 | 1,08 | 2,09 |
| correlative | 15 | 0 | 0 | 1,0 | -1,0 |
| interjection | 23 | 3 | 6 | 1,12 | 2,0 |
| verb | 1269 | 143 | 302 | 1,11 | 2,11 |
| noun | 5617 | 393 | 836 | 1,07 | 2,13 |
| particle | 7 | 3 | 8 | 1,5 | 2,67 |
| pronoun | 53 | 8 | 17 | 1,15 | 2,12 |
| phrase | 45 | 1 | 2 | 1,02 | 2,0 |
| numeral | 96 | 12 | 24 | 1,11 | 2,0 |
| adverb | 551 | 45 | 96 | 1,09 | 2,13 |
| conjunction | 16 | 2 | 4 | 1,11 | 2,0 |
Latin entries [edit]
Number of words with unknown POS: 78
Number of words and senses [edit]
Rows in the table: 24
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 6 | 14 | gerund | 4 | laborandum 4, definiendum 3, sufflaminandum 2 |
| 459 | 509 | proper noun | 3 | Africa 3, Uranus 3, Beelzebub 3 |
| 15 | 15 | letter | 1 | a 1, C 1, e 1 |
| 47 | 93 | preposition | 7 | pro 7, sub 5, absque 3 |
| 25 | 33 | prefix | 6 | a- 6, co- 2, ne- 1 |
| 1 | 2 | infix | 2 | -n- 2 |
| 6 | 10 | determiner | 3 | ille 3, ambo 2, idem 1 |
| 3933 | 6104 | adjective | 10 | raptus 10, liquidus 7, alienus 6 |
| 65 | 76 | suffix | 3 | -ve 3, -icus 3, -arius 2 |
| 38 | 42 | interjection | 2 | o 2, hem 2, hui 2 |
| 4887 | 10872 | verb | 30 | agar 30, iaceto 28, iacuerimus 28 |
| 3 | 4 | particle | 2 | -ne 2, non 1 |
| 7122 | 12076 | noun | 16 | manus 16, caput 9, linea 9 |
| 27 | 28 | proverb | 2 | obsta principiis 2, albus an ater sit 1, asinus in tegulis 1 |
| 11 | 14 | idiom | 3 | nil desperandum 3, bellum gerere 1, ceterum censeo 1 |
| 1 | 1 | prepositional phrase | 1 | a posteriori 1 |
| 113 | 147 | pronoun | 4 | chodchod 4, sese 4, memet 3 |
| 4 | 4 | symbol | 1 | Ↄ 1, Ⅎ 1, ↄ 1 |
| 103 | 113 | numeral | 3 | secundum 3, sexagesima 3, sexagesimum 3 |
| 1 | 1 | interfix | 1 | -o- 1 |
| 932 | 1384 | adverb | 7 | ultro 7, continenter 5, tanquam 5 |
| 137 | 160 | phrase | 3 | Socratici viri 3, gratia gratiam parit 3, in medio 3 |
| 64 | 103 | conjunction | 5 | nam 5, enim 4, nec 4 |
| 4869 | 7618 | participle | 14 | iacens 14, iaciturus 14, productus 9 |
Polysemy information [edit]
Rows in the table: 24
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| gerund | 2 | 4 | 12 | 2,33 | 3,0 |
| proper noun | 414 | 45 | 95 | 1,11 | 2,11 |
| letter | 15 | 0 | 0 | 1,0 | -1,0 |
| preposition | 25 | 22 | 68 | 1,98 | 3,09 |
| prefix | 22 | 3 | 11 | 1,32 | 3,67 |
| infix | 0 | 1 | 2 | 2,0 | 2,0 |
| determiner | 3 | 3 | 7 | 1,67 | 2,33 |
| adjective | 2503 | 1430 | 3601 | 1,55 | 2,52 |
| suffix | 56 | 9 | 20 | 1,17 | 2,22 |
| interjection | 34 | 4 | 8 | 1,11 | 2,0 |
| verb | 2114 | 2773 | 8758 | 2,22 | 3,16 |
| particle | 2 | 1 | 2 | 1,33 | 2,0 |
| noun | 4107 | 3015 | 7969 | 1,7 | 2,64 |
| proverb | 26 | 1 | 2 | 1,04 | 2,0 |
| idiom | 9 | 2 | 5 | 1,27 | 2,5 |
| prepositional phrase | 1 | 0 | 0 | 1,0 | -1,0 |
| pronoun | 86 | 27 | 61 | 1,3 | 2,26 |
| symbol | 4 | 0 | 0 | 1,0 | -1,0 |
| numeral | 97 | 6 | 16 | 1,1 | 2,67 |
| interfix | 1 | 0 | 0 | 1,0 | -1,0 |
| adverb | 613 | 319 | 771 | 1,48 | 2,42 |
| phrase | 120 | 17 | 40 | 1,17 | 2,35 |
| conjunction | 40 | 24 | 63 | 1,61 | 2,62 |
| participle | 3168 | 1701 | 4450 | 1,56 | 2,62 |
Italian entries [edit]
Number of words with unknown POS: 165
Number of words and senses [edit]
Rows in the table: 19
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 11 | 11 | article | 1 | le 1, i 1, lo 1 |
| 2288 | 2588 | proper noun | 4 | Parma 4, Bodoni 3, Ragusa 3 |
| 26 | 26 | letter | 1 | I 1, C 1, X 1 |
| 301 | 360 | preposition | 9 | di 9, per 6, su 5 |
| 411 | 455 | prefix | 4 | filo- 4, ana- 3, ceno- 3 |
| 1 | 1 | affix | 1 | -un- 1 |
| 19655 | 23531 | adjective | 10 | minore 10, maggiore 9, arcigno 6 |
| 317 | 379 | suffix | 5 | -ata 5, -ite 5, -ato 3 |
| 167 | 212 | interjection | 4 | cazzo 4, buon giorno 4, diavolo 3 |
| 33565 | 39455 | verb | 14 | puntare 14, sbattere 9, spuntare 9 |
| 44182 | 55602 | noun | 11 | titolo 11, manomissione 10, manopola 10 |
| 4 | 4 | idiom | 1 | la quiete prima della tempesta 1, il gatto ti ha mangiato la lingua 1, pollice verde 1 |
| 23 | 24 | proverb | 2 | l'abito non fa il monaco 2, Roma non fu fatta in un giorno 1, meglio un uovo oggi che una gallina domani 1 |
| 157 | 214 | pronoun | 4 | ci 4, qualcuno 4, il quale 4 |
| 2 | 2 | numeral | 1 | otto 1, uno 1 |
| 65 | 70 | phrase | 2 | alle calende greche 2, che figata 2, detto, fatto 2 |
| 3 | 3 | symbol | 1 | × 1, Z 1, W 1 |
| 2945 | 3586 | adverb | 5 | dolcemente 5, molto 4, al contrario 4 |
| 125 | 151 | conjunction | 3 | se 3, se non 3, con 2 |
Polysemy information [edit]
Rows in the table: 19
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| article | 11 | 0 | 0 | 1,0 | -1,0 |
| proper noun | 1999 | 289 | 589 | 1,13 | 2,04 |
| letter | 26 | 0 | 0 | 1,0 | -1,0 |
| preposition | 274 | 27 | 86 | 1,2 | 3,19 |
| prefix | 373 | 38 | 82 | 1,11 | 2,16 |
| affix | 1 | 0 | 0 | 1,0 | -1,0 |
| adjective | 16610 | 3045 | 6921 | 1,2 | 2,27 |
| suffix | 268 | 49 | 111 | 1,2 | 2,27 |
| interjection | 131 | 36 | 81 | 1,27 | 2,25 |
| verb | 29321 | 4244 | 10134 | 1,18 | 2,39 |
| noun | 35739 | 8443 | 19863 | 1,26 | 2,35 |
| idiom | 4 | 0 | 0 | 1,0 | -1,0 |
| proverb | 22 | 1 | 2 | 1,04 | 2,0 |
| pronoun | 112 | 45 | 102 | 1,36 | 2,27 |
| numeral | 2 | 0 | 0 | 1,0 | -1,0 |
| phrase | 60 | 5 | 10 | 1,08 | 2,0 |
| symbol | 3 | 0 | 0 | 1,0 | -1,0 |
| adverb | 2426 | 519 | 1160 | 1,22 | 2,24 |
| conjunction | 101 | 24 | 50 | 1,21 | 2,08 |
Swedish entries [edit]
Number of words with unknown POS: 316
Number of words and senses [edit]
Rows in the table: 22
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 5 | 5 | article | 1 | de 1, en 1, den 1 |
| 1644 | 1763 | proper noun | 4 | Paris 4, Europa 3, Josua 3 |
| 11 | 11 | letter | 1 | e 1, o 1, delta 1 |
| 80 | 123 | preposition | 5 | om 5, på 5, runt 5 |
| 8 | 8 | acronym | 1 | ABF 1, EU 1, FN 1 |
| 1 | 1 | postposition | 1 | ut 1 |
| 42 | 49 | prefix | 3 | över- 3, e- 2, för- 2 |
| 11 | 12 | determiner | 2 | var 2, de där 1, dessa 1 |
| 2086 | 2546 | adjective | 8 | hård 8, öppen 8, rå 6 |
| 80 | 111 | suffix | 5 | -e 5, -t 5, -en 4 |
| 104 | 119 | interjection | 3 | god fortsättning 3, varsågod 3, punkt 2 |
| 2435 | 3542 | verb | 23 | gå 23, hålla 14, lägga 13 |
| 2 | 2 | particle | 1 | om 1, att 1 |
| 9050 | 11458 | noun | 11 | bryt 11, rot 9, tunga 9 |
| 26 | 26 | proverb | 1 | Gå inte över ån efter vatten 1, Rom byggdes inte på en dag 1, nära skjuter ingen hare 1 |
| 54 | 54 | idiom | 1 | det vete gudarna 1, bakom flötet 1, bli tagen på sängen 1 |
| 128 | 152 | pronoun | 4 | vilken 4, det 3, som 2 |
| 64 | 65 | phrase | 2 | sida upp och sida ned 2, gott nytt år 1, jag är rädd för det 1 |
| 1 | 1 | interfix | 1 | -s- 1 |
| 682 | 791 | adverb | 7 | tillgodo 7, så 5, precis 5 |
| 166 | 167 | numeral | 2 | artonhundra 2, tionde 1, elfte 1 |
| 47 | 58 | conjunction | 6 | och 6, emellertid 3, då 2 |
Polysemy information [edit]
Rows in the table: 22
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| article | 5 | 0 | 0 | 1,0 | -1,0 |
| proper noun | 1532 | 112 | 231 | 1,07 | 2,06 |
| letter | 11 | 0 | 0 | 1,0 | -1,0 |
| preposition | 60 | 20 | 63 | 1,54 | 3,15 |
| acronym | 8 | 0 | 0 | 1,0 | -1,0 |
| postposition | 1 | 0 | 0 | 1,0 | -1,0 |
| prefix | 36 | 6 | 13 | 1,17 | 2,17 |
| determiner | 10 | 1 | 2 | 1,09 | 2,0 |
| adjective | 1778 | 308 | 768 | 1,22 | 2,49 |
| suffix | 62 | 18 | 49 | 1,39 | 2,72 |
| interjection | 91 | 13 | 28 | 1,14 | 2,15 |
| verb | 1786 | 649 | 1756 | 1,45 | 2,71 |
| particle | 2 | 0 | 0 | 1,0 | -1,0 |
| noun | 7547 | 1503 | 3911 | 1,27 | 2,6 |
| proverb | 26 | 0 | 0 | 1,0 | -1,0 |
| idiom | 54 | 0 | 0 | 1,0 | -1,0 |
| pronoun | 108 | 20 | 44 | 1,19 | 2,2 |
| phrase | 63 | 1 | 2 | 1,02 | 2,0 |
| interfix | 1 | 0 | 0 | 1,0 | -1,0 |
| adverb | 607 | 75 | 184 | 1,16 | 2,45 |
| numeral | 165 | 1 | 2 | 1,01 | 2,0 |
| conjunction | 41 | 6 | 17 | 1,23 | 2,83 |
Spanish entries [edit]
Number of words with unknown POS: 153
Number of words and senses [edit]
Rows in the table: 23
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 7 | 7 | article | 1 | lo 1, la 1, el 1 |
| 2 | 2 | expression | 1 | o 1, había 1 |
| 1395 | 1577 | proper noun | 4 | África 4, Mérida 4, Jacobo 4 |
| 62 | 63 | letter | 2 | Ñ 2, a 1, x 1 |
| 46 | 72 | preposition | 5 | en 5, por 5, según 3 |
| 5 | 6 | acronym | 2 | PAN 2, SAG 1, SAMU 1 |
| 41 | 44 | prefix | 3 | re- 3, bis- 2, in- 1 |
| 1 | 1 | affix | 1 | -un- 1 |
| 3 | 3 | determiner | 1 | cada 1, cierto 1, uno 1 |
| 4878 | 6043 | adjective | 8 | duro 8, chocho 6, pegado 6 |
| 129 | 198 | suffix | 12 | -ón 12, -ón 7, -azo 4 |
| 134 | 162 | interjection | 4 | ojalá 4, hombre 3, alá 3 |
| 5647 | 7652 | verb | 18 | dar 18, curar 14, tirar 14 |
| 14616 | 18894 | noun | 15 | medio 15, casco 13, mano 10 |
| 5 | 5 | idiom | 1 | algo del otro mundo 1, nada del otro mundo 1, con mal pie 1 |
| 15 | 17 | proverb | 2 | El hábito no hace al monje 2, del dicho al hecho hay mucho trecho 2, a todo cerdo le llega su san Martín 1 |
| 115 | 154 | pronoun | 4 | se 4, les 4, me 3 |
| 117 | 133 | phrase | 4 | a la buena de Dios 4, al fin y al cabo 4, de pe a pa 4 |
| 56 | 57 | numeral | 2 | sesenta 2, quince 1, once 1 |
| 839 | 965 | adverb | 9 | ya 9, encima 5, al revés 4 |
| 5 | 7 | symbol | 2 | @ 2, ¡ 2, ⸘ 1 |
| 1 | 2 | participle | 2 | amado 2 |
| 36 | 55 | conjunction | 6 | que 6, no obstante 4, sino 3 |
Polysemy information [edit]
Rows in the table: 23
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| article | 7 | 0 | 0 | 1,0 | -1,0 |
| expression | 2 | 0 | 0 | 1,0 | -1,0 |
| proper noun | 1234 | 161 | 343 | 1,13 | 2,13 |
| letter | 61 | 1 | 2 | 1,02 | 2,0 |
| preposition | 32 | 14 | 40 | 1,57 | 2,86 |
| acronym | 4 | 1 | 2 | 1,2 | 2,0 |
| prefix | 39 | 2 | 5 | 1,07 | 2,5 |
| affix | 1 | 0 | 0 | 1,0 | -1,0 |
| determiner | 3 | 0 | 0 | 1,0 | -1,0 |
| adjective | 4031 | 847 | 2012 | 1,24 | 2,38 |
| suffix | 94 | 35 | 104 | 1,53 | 2,97 |
| interjection | 113 | 21 | 49 | 1,21 | 2,33 |
| verb | 4493 | 1154 | 3159 | 1,36 | 2,74 |
| noun | 11813 | 2803 | 7081 | 1,29 | 2,53 |
| idiom | 5 | 0 | 0 | 1,0 | -1,0 |
| proverb | 13 | 2 | 4 | 1,13 | 2,0 |
| pronoun | 92 | 23 | 62 | 1,34 | 2,7 |
| phrase | 107 | 10 | 26 | 1,14 | 2,6 |
| numeral | 55 | 1 | 2 | 1,02 | 2,0 |
| adverb | 748 | 91 | 217 | 1,15 | 2,38 |
| symbol | 3 | 2 | 4 | 1,4 | 2,0 |
| participle | 0 | 1 | 2 | 2,0 | 2,0 |
| conjunction | 26 | 10 | 29 | 1,53 | 2,9 |
Mandarin entries [edit]
Number of words with unknown POS: 2922
Number of words and senses [edit]
Rows in the table: 27
| Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
|---|---|---|---|---|
| 2 | 4 | article | 2 | 一个 2, 一個 2 |
| 46 | 52 | preposition | 3 | 下 3, chúle...yǐwài 2, yú 2 |
| 2 | 2 | syllable | 1 | 的 1, 吧 1 |
| 24 | 25 | postposition | 2 | 一下 2, 以后 1, 以前 1 |
| 122 | 138 | suffix | 4 | -cài 4, -fa 3, -fǎ 3 |
| 200 | 221 | interjection | 3 | 萬歲 3, 万岁 3, 好說 3 |
| 24 | 30 | particle | 4 | d 4, di 1, の 1 |
| 446 | 481 | proverb | 4 | 三十年河东,三十年河西 4, 三十年河東,三十年河西 4, 一日為師,終身為父 3 |
| 1830 | 1985 | idiom | 4 | 不上不下 4, 进退两难 3, biéchūjīzhù 3 |
| 126 | 161 | pronoun | 6 | tāmen 6, tā 6, 怎麼 3 |
| 1393 | 25022 | pinyin | 239 | yì 239, yù 164, zī 142 |
| 33 | 35 | measure word | 2 | 条 2, shēng 2, bàn 1 |
| 3184 | 3319 | proper noun | 4 | 山東 4, 山东 4, 陶斯 4 |
| 11 | 11 | letter | 1 | ㄅ 1, ㄆ 1, ㄇ 1 |
| 20211 | 20378 | hanzi | 11 | 重 11, 书 10, 托 7 |
| 20 | 25 | prefix | 3 | bái- 3, 超 2, 子 2 |
| 1511 | 27001 | pinyin syllable | 239 | yi4 239, yu4 164, zhi4 133 |
| 211 | 295 | affix | 9 | fǎ 9, yuán 8, zhōng 8 |
| 8 | 8 | determiner | 1 | 幾個 1, 几个 1, 更多 1 |
| 2393 | 2743 | adjective | 9 | qīng 9, dà 5, 暧昧 5 |
| 6690 | 7497 | verb | 14 | 下 14, 收 7, 走火 6 |
| 16397 | 18466 | noun | 12 | 帅 12, 烟 7, 六书 7 |
| 235 | 242 | phrase | 2 | youyiyi 2, zhùxué 2, zěnme bàn 2 |
| 962 | 1061 | adverb | 5 | 丁丁 5, 偏偏 5, 丁当 3 |
| 30 | 32 | numeral | 2 | bàn 2, 万 2, 千萬 1 |
| 4 | 4 | symbol | 1 | 囧 1, ; 1, { 1 |
| 159 | 168 | conjunction | 3 | 如 3, 就此 3, 除非 2 |
Polysemy information [edit]
Rows in the table: 27
| POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
|---|---|---|---|---|---|
| article | 0 | 2 | 4 | 2,0 | 2,0 |
| preposition | 41 | 5 | 11 | 1,13 | 2,2 |
| syllable | 2 | 0 | 0 | 1,0 | -1,0 |
| postposition | 23 | 1 | 2 | 1,04 | 2,0 |
| suffix | 111 | 11 | 27 | 1,13 | 2,45 |
| interjection | 183 | 17 | 38 | 1,11 | 2,24 |
| particle | 21 | 3 | 9 | 1,25 | 3,0 |
| proverb | 420 | 26 | 61 | 1,08 | 2,35 |
| idiom | 1701 | 129 | 284 | 1,08 | 2,2 |
| pronoun | 107 | 19 | 54 | 1,28 | 2,84 |
| pinyin | 168 | 1225 | 24854 | 17,96 | 20,29 |
| measure word | 31 | 2 | 4 | 1,06 | 2,0 |
| proper noun | 3072 | 112 | 247 | 1,04 | 2,21 |
| letter | 11 | 0 | 0 | 1,0 | -1,0 |
| hanzi | 20124 | 87 | 254 | 1,01 | 2,92 |
| prefix | 16 | 4 | 9 | 1,25 | 2,25 |
| pinyin syllable | 208 | 1303 | 26793 | 17,87 | 20,56 |
| affix | 174 | 37 | 121 | 1,4 | 3,27 |
| determiner | 8 | 0 | 0 | 1,0 | -1,0 |
| adjective | 2124 | 269 | 619 | 1,15 | 2,3 |
| verb | 5998 | 692 | 1499 | 1,12 | 2,17 |
| noun | 14731 | 1666 | 3735 | 1,13 | 2,24 |
| phrase | 228 | 7 | 14 | 1,03 | 2,0 |
| adverb | 879 | 83 | 182 | 1,1 | 2,19 |
| numeral | 28 | 2 | 4 | 1,07 | 2,0 |
| symbol | 4 | 0 | 0 | 1,0 | -1,0 |
| conjunction | 152 | 7 | 16 | 1,06 | 2,29 |
References [edit]
- ^ This (or more recent) database would be available at the project site wikokit, see Download section.