This user is currently working on
adding English, Turkish, Arabic, Persian and Kurdish names of Turkish and Iranian provinces and Syrian and Iraqi governorates, all sourced from local Wikipedias and contained in this sheet.
Here's the thing; I may be a native speaker of Arabic, but I might struggle with MSA at times ^^' Plus, I'm re-learning the language so I can be fluent in it again
Semitic Lemmas
| Language |
Lemma Pages (19/04/2026)
|
| Arabic |
21,936 |
|
| Egyptian Arabic |
903 |
|
| North Levantine Arabic |
670 |
|
| South Levantine Arabic |
2,894 |
|
| Iraqi Arabic |
174 |
|
| North Mesopotamian Arabic |
8 |
|
| Cypriot Arabic |
414 |
|
| Hijazi Arabic |
1,781 |
|
| Najdi Arabic |
40 |
|
| Gulf Arabic |
649 |
|
| Baharna Arabic |
4 |
|
| Omani Arabic |
9 |
|
| Yemeni Arabic |
12 |
|
| Sudanese Arabic |
44 |
|
| Juba Arabic |
110 |
|
| Libyan Arabic |
167 |
|
| Tunisian Arabic |
71 |
|
| Algerian Arabic |
38 |
|
| Andalusian Arabic |
30 |
|
| Moroccan Arabic |
1,757 |
|
| Hassaniya Arabic |
7
|
| Chadian Arabic |
205 |
|
| Maltese |
11,441 |
|
| Hebrew |
12,320 |
|
| Aramaic |
1,850 |
|
| Classical Syriac |
2,622 |
|
| Assyrian Neo-Aramaic |
5,498 |
|
| Turoyo |
207 |
|
| Mlahsö |
1 |
|
| Hértevin |
1 |
|
| Lishana Deni |
94 |
|
| Harsusi |
3 |
|
| Bathari |
1 |
|
| Shehri |
5 |
|
| Hobyót |
5 |
|
| Mehri |
25 |
|
| Soqotri |
35 |
|
| Amharic |
1,876 |
|
| Tigrinya |
868 |
|
| Tigre |
277 |
|
| Ge'ez |
496 |
|
/Semitic resources
Turkic Lemmas
| Language |
Lemma Pages (19/04/2026)
|
| Turkish |
26,433 |
|
| Khorasani Turkish |
9 |
|
| Ottoman Turkish |
7,998 |
|
| Gagauz |
924 |
|
| Azerbaijani |
11,128 |
|
| Turkmen |
1,774 |
|
| Salar |
1,069 |
|
| Crimean Tatar |
4,559 |
|
| Karaim |
359 |
|
| Urum |
61 |
|
| Krymchak |
10 |
|
| Karachay-Balkar |
161 |
|
| Kumyk |
1,390 |
|
| Tatar |
1,852 |
|
| Bashkir |
2,852 |
|
| Nogai |
464 |
|
| Kazakh |
11,260 |
|
| Karakalpak |
117 |
|
| Siberian Tatar |
14 |
|
| Kyrgyz |
2,654 |
|
| Southern Altai |
1,723 |
|
| Uzbek |
3,576 |
|
| Uyghur |
3,710 |
|
| Äynu |
5 |
|
| Ili Turki |
10 |
|
| Northern Altai |
1,205 |
|
| Khakas |
408 |
|
| Tofa |
41 |
|
| Tuvan |
773 |
|
| Shor |
226 |
|
| Dolgan |
296 |
|
| Yakut |
3,038 |
|
| Chuvash |
801 |
|
Germanic Lemmas
| Language |
Lemma Pages (19/04/2026)
|
| English |
859,604 |
|
| Scots |
4,239 |
|
| Dutch |
65,051 |
|
| German |
102,822 |
|
| Yiddish |
9,992 |
|
| Swedish |
60,410 |
|
| Danish |
23,711 |
|
| Norwegian Bokmål |
21,554 |
|
| Norwegian Nynorsk |
24,342 |
|
| Faroese |
7,259 |
|
| Icelandic |
18,798 |
|
Romance Lemmas
| Language |
Lemma Pages (19/04/2026)
|
| Latin |
44,933 |
|
| Portuguese |
73,261 |
|
| Galician |
17,377 |
|
| Mirandese |
479 |
|
| Asturian |
7,151 |
|
| Extremaduran |
169 |
|
| Spanish |
114,811 |
|
| Aragonese |
1,369 |
|
| Catalan |
32,777 |
|
| Occitan |
5,012 |
|
| French |
97,581 |
|
| Gallo |
206 |
|
| Walloon |
2,552 |
|
| Romansh |
2,206 |
|
| Ladin |
1,494 |
|
| Venetan |
2,652 |
|
| Corsican |
825 |
|
| Italian |
128,746 |
|
| Neapolitan |
1,339 |
|
| Sicilian |
2,679 |
|
| Sardinian |
1,139 |
|
| Aromanian |
3,983 |
|
| Romanian |
112,838 |
|
Balto-Slavic Lemmas
| Language |
Lemma Pages (19/04/2026)
|
| Latvian |
12,021 |
|
| Latgalian |
541 |
|
| Lithuanian |
8,223 |
|
| Samogitian |
158 |
|
| Russian |
60,435 |
|
| Belarusian |
5,643 |
|
| Ukrainian |
29,575 |
|
| Polish |
99,481 |
|
| Kashubian |
2,735 |
|
| Silesian |
2,408 |
|
| Lower Sorbian |
1,773 |
|
| Upper Sorbian |
947 |
|
| Czech |
49,450 |
|
| Slovak |
12,262 |
|
| Slovene |
5,819 |
|
| Serbo-Croatian |
57,697 |
|
| Macedonian |
45,999 |
|
| Bulgarian |
17,628 |
|
Indo-Iranian Lemmas
| Language |
Lemma Pages (19/04/2026)
|
| Ossetian |
863 |
|
| Zazaki |
1,056 |
|
| Northern Kurdish |
3,270 |
|
| Central Kurdish |
1,229 |
|
| Southern Kurdish |
125 |
|
| Persian |
14,986 |
|
| Tajik |
3,040 |
|
| Talysh |
296 |
|
| Gilaki |
32 |
|
| Mazanderani |
306 |
|
| Pashto |
1,555 |
|
| Baluchi |
404 |
|
| Sanskrit |
9,952 |
|
| Sindhi |
1,503 |
|
| Gawar-Bati |
826 |
|
| Ushojo |
2,948 |
|
| Kashmiri |
1,742 |
|
| Punjabi |
6,308 |
|
| Saraiki |
371 |
|
| Hindi |
21,659 |
|
| Urdu |
7,656 |
|
| Gujarati |
6,694 |
|
| Nepali |
1,994 |
|
| Assamese |
3,935 |
|
| Bengali |
9,539 |
|
| Odia |
1,913 |
|
| Marathi |
4,551 |
|
| Sinhalese |
1,153 |
|
| Dhivehi |
1,867 |
|
Austronesian Lemmas
| Language |
Lemma Pages (19/04/2026)
|
| Tagalog |
30,529 |
|
| Cebuano |
14,964 |
|
| Ilocano |
1,515 |
|
| Central Bikol |
6,542 |
|
| Cuyunon |
79 |
|
| Ibaloi |
328 |
|
| Kankanaey |
821 |
|
| Kayapa Kallahan |
54 |
|
| Limos Kalinga |
479 |
|
| Sambali |
522 |
|
| Waray-Waray |
720 |
|
| Malay |
9,952 |
|
| Indonesian |
31,276 |
|
| Javanese |
3,234 |
|
| Sundanese |
3,086 |
|
| Māori |
2,412 |
|
| Hawaiian |
3,056 |
|
| Malagasy |
3,634 |
|
Non-Semitic Lemmas
| Language |
Lemma Pages (19/04/2026)
|
| Armenian |
18,929 |
|
| Greek |
32,641 |
|
| Albanian |
13,370 |
|
| Basque |
4,884 |
|
| Chinese |
303,582
|
| Burmese |
8,142 |
|
| Thai |
16,916 |
|
| Lao |
2,342 |
|
| Khmer |
9,382 |
|
| Vietnamese |
43,776 |
|
| Swahili |
12,518 |
|
| Afrikaans |
6,229 |
|
| Zulu |
2,863 |
|
| Xhosa |
3,127 |
|
/sandbox (Linking this so I can access it much easier. Nobody else should edit it)
(Non-Semitic) Resources :3
- Shabdkosh for Bengali, Gujarati, Hindi, Kannada, Konkani, Malayalam, Marathi, Nepali, Odia, Punjabi, Sanskrit, Tamil, Telugu and Urdu
- A Sinhalese-English dictionary by Charles Csrter for Sinhala
- Jisho for Japanese
General Resources :3
- Almaaany (Arabic to English, French, German, Indonesian, Persian, Portuguese, Russian, Spanish, Turkish and Urdu)
- Almaaany (English to Bulgarian, Chinese, Croatian, Dutch, French, German, Greek, Hebrew, Hindi, Hungarian, Italian, Japanese, Korean, Portuguese, Romanian, Russian, Spanish, Swedish and Turkish)