User:-sche/missing codes

Definition from Wiktionary, the free dictionary
Jump to: navigation, search

ISO 639 includes 7865 codes; this is a list of those codes which are not present in Wiktionary. Some are intentionally excluded by us or merged by us into other codes, but others are missing by simple oversight, or in some cases because naming conflicts years ago prevented importation.

For a list of codes which the ISO has retired but which Wiktionary still includes, see User:-sche/retired codes. (Items also listed there and in the BP should be struck here.)

For a list of codes which are present in Wiktionary but which should not be, see User:-sche/notlanguages. Also note w:Spurious languages: we currently include and encode some lects which WP thinks may not exist, including Yiddish Sign Language, Parsi 'prp' and Parsi-Dari 'prd'.

See also Wiktionary:Beer parlour/2015/January#Codes_the_ISO_deleted_or_added_in_2014.

Languages to possibly rename or merge[edit]

Sections with Chins not yet having family info, etc: T, V, W.

Languages to possible rename or merge:

  1. Antigua and Barbuda Creole English Antigua and Barbuda Creole English
  2. Judeo-Tunisian Arabic Judeo-Tunisian Arabic
  3. Judeo-Moroccan Arabic Judeo-Moroccan Arabic
  4. Guerrero Amuzgo Guerrero Amuzgo
  5. Eastern Egyptian Bedawi Arabic Eastern Egyptian Bedawi Arabic
  6. Arua Aruá (Amazonas State)
  7. Aruá Aruá (Rodonia State)
  8. Okpela Ivbie North-Okpela-Arhe
  9. Anus Anus
  10. Austral Austral
  1. Remontado Agta Remontado Dumagat
    (is this a distinct language? vide b.g.c. "Sinauna")
  1. Emberá-Baudó Emberá-Baudó
  2. Indonesian Bajau Indonesian Bajau
  3. West Coast Bajau West Coast Bajau
  4. Balkan Gagauz Turkish Balkan Gagauz Turkish
  5. Bohtan Neo-Aramaic Bohtan Neo-Aramaic
  6. Vaghat-Ya-Bijim-Legeri Vaghat-Ya-Bijim-Legeri
  7. Biao-Jiao Mien Biao-Jiao Mien
  1. Iriga Bicolano Rinconada Bikol
  2. Bura Bura-Pabir
  3. Bafaw-Balong Bafaw-Balong
  4. Bu-Nao Bunu Bu-Nao Bunu
  5. Belize Kriol English Belize Kriol English
  6. Hainyaxo Bozo Kɛlɛngaxo Bozo
  1. Cutchi-Swahili Cutchi-Swahili
  2. Malaccan Creole Malay Malaccan Creole Malay
  3. Cinda-Regi-Tiyal Cinda-Regi-Tiyal
  4. Rukiga Chiga
  5. Chocangacakha Chocangacakha
  6. Chinook Jargon Chinook jargon
  7. Chipewyan Chipewyan
  8. Cakchiquel-Quiché Mixed Language Cakchiquel-Quiché Mixed Language
  9. Central Mnong Central Mnong
  10. Chichonyi-Chidzihana-Chikauma Chonyi-Dzihana-Kauma
  11. Ucayali-Yurúa Ashéninka Ucayali-Yurúa Ashéninka
  12. Ajyíninka Apurucayali Ajyíninka Apurucayali
  13. Cappadocian Greek Cappadocian Greek
  14. Chinese Pidgin English Chinese Pidgin English
  15. Pichis Ashéninka Pichis Ashéninka
  16. South Ucayali Ashéninka South Ucayali Ashéninka
  17. Chuanqiandian Cluster Miao Chuanqiandian Cluster Miao
  18. Cora El Nayar Cora
  19. Seychellois Creole Seselwa Creole French
  20. Knaanic Knaanic
check names
  • Gciriku; — I; L; Diriku;
other concerns
  • Pidgin Delaware — Pidgin Delaware
  • Hindi Dogri — Dogri (individual language)
  • Dogri — Dogri (macrolanguage)
  • Northwestern Dinka — Northwestern Dinka
  • Dixon Reef — Dixon Reef
  • Dar Daju Daju — Dar Daju Daju
  • Dong — Dong
  • Doura — Toura (Papua New Guinea)
  • Fang (Cameroon) — Fang (Cameroon)
  • Fang (Guinea) — Fang (Equatorial Guinea)
  • Flinders Island — Flinders Island


  • gpe ; — I; L; Ghanaian Pidgin English;
    Very similar to {{pcm}} and {{kri}}, I doubt that this is an independent lect —Μετάknowledgediscuss/deeds 18:21, 11 February 2013 (UTC)
Hm, but it would be awkward to merge it into (and call it) Nigerian Pidgin. Perhaps we should create an exceptional code for "West African Pidgin", and merge {{gpe}} and {{pcm}} into it? - -sche (discuss) 23:06, 11 February 2013 (UTC)
That's what I was thinking, but I don't know enough about them to feel comfortable making such a big step. —Μετάknowledgediscuss/deeds 23:13, 11 February 2013 (UTC)
Template:gmz (didn't get caught? hmm...) — Mgbolizhia - also called the Mgbo language


  • ihw — Bidhawal - a dialect of/with unn
  • jan — Jandai
  • jbi — Badjiri - possibly not even Karnic; cf my notes about ekc above and on User:-sche/retired codes
  • jbk (Barikewa) and jmw (Mouwase) — varieties of {{mgx}} Omati/Mini, said to be quite divergent from each other: but we should either have mgx or have jbk+jmw, not all three
  • jbw — Yawijibaya
  • jgk — Gwak
  • jjr — Bankal
  • jms — Mashi (Nigeria)
  • jog — Jogi
  • jui — Ngadjuri


  • nmx — I; L; Nama (Papua New Guinea); - Nama (New Guinea) - distinguished from naq "Nama", a dialect of nza "Tigon Mbembe", a Jukunoid language of Cameroon and Nigeria
  • npg — I; L; Ponyo-Gongwang Naga;
  • nqy — I; L; Akyaung Ari Naga;
  • nsf — I; L; Northwestern Nisu;
  • ntx — I; L; Tangkhul Naga (Myanmar); — Tangkhul Naga (Myanmar) / (Burma) (only so named to distinguish it from another 'Naga'...?)
  • nwg — I; E; Ngayawung;
  • nxk — I; L; Koki Naga;
  • oke — I; L; Okpe (Southwestern Edo);
  • okx — I; L; Okpe (Northwestern Edo); - find a better name ... "Southwestern Edoid Okpe" and "Northwestern Edoid Okpe"?
  • olk — I; E; Olkol;
  • orc — I; L; Orma;


  • sfe; — I; L; Eastern Subanen; -
  • sgj; — I; L; Surgujia; - Suraji, Surguja, Surgujia-Chhattisgarhi, Surjugia
  • sng; — I; L; Sanga (Democratic Republic of Congo);
  • sox; — I; L; Swo; -
  • spb; — I; L; Sepa (Indonesia);


  • tcl — Taman (Myanmar) / (Burma) - (extinct?) - This is a variety, like 'zkd' and 'tcl', of the language Sak. Which region name to use if we decide to include the lect at all is a hard question.
  • tgj ; — I; L; Tagin; -
  • tgz ; — I; E; Tagalaka; -
  • tjl ; — I; L; Tai Laing; -
  • tmn — Taman (Indonesia) - (living?) - OK but for the naming conflict
  • tst ; — I; L; Tondi Songway Kiini; -
other concerns
  • Tobagonian Creole English; — I; L; Tobagonian Creole English; - merge?
    Turks And Caicos Creole English; — I; L; Turks And Caicos Creole English; - merge with other similar Creole Englishes from that area?


  • xsn ; — I; L; Sanga (Nigeria);
  • xyy ; — I; L; Yorta Yorta;
  • yhs — Yan-nhaŋu Sign Language - signed by 10 people, not that distinct from ygs
  • yxy — Yabula Yabula - (extinct) - ?
other concerns
we have a lot of Zapotec varieties; someone could check if any need to be merged


gjr, fnb, dtn

special cases (split codes, etc)[edit]


Gbaya is the name of quite a few languages. 'krs', a Gbaya of Sudan, is able to be called "Kresh". But there is also 'gba', the Gbaya family(!), many of the members of which are called Gbaya. See w:Gbaya languages. 'gba' may need to be split.

sorted by name[edit]


Template:sng — Sanga (Democratic Republic of the Congo) - what's the betting that they just have this confused with another Sanga? See Citations:Sanga. —Μετάknowledgediscuss/deeds 02:43, 12 February 2013 (UTC)
It wouldn't be the first time they mixed up languages' names. It's interesting to consider the ways in which SIL/ISO/Ethnologue are similar to us Wiktionarians; they're fallible people attempting an enormous, inherently messy task (identifying every language in the world). They seem to have done OK here, though: "Luba-Sanga" ("southern Luba") exists, and it makes sense that it would sometimes be called bare "Sanga", as they suggest. And Luba-Sanga is in a different family than either Sango or Numana-Nunku-Gbantu-Numbu, which would make confusion harder / more impressive. We should call it "Luba-Sanga" rather than "Sanga (Democratic Republic of Congo)", though, just to do what we can to avoid confusion. - -sche (discuss) 06:06, 13 February 2013 (UTC)
Template:xsn — Sanga (Nigeria) - yet another Sanga, and probably a product of the confusion that can be seen at Citations:Sanga and with Template:sng, Template:sg, and Template:nbr. —Μετάknowledgediscuss/deeds 19:52, 12 February 2013 (UTC)
Gah. The Sangas are still not quite as numerous as the Karas, but they're trying to be. - -sche (discuss) 06:34, 13 February 2013 (UTC)

codes to (possibly) delete[edit]

  • Template:mbo, Mbo (Cameroon), is part of a dialect cluster with 'bss' – Akoose, 'bqz' – Kaka (Central Mbo), and 'bsi' – Sosi
  • White and Green Hmong could be merged into e.g. "Laotian Hmong". This could either use one of those lects' codes, or have an exceptional code, e.g. hmn-wgn (White+Green/Njua).

existing codes to (possibly) rename[edit]

  • Indonesian Bajau (bdl) – Sulawesi [Bajaw]
  • West Coast Bajau (bdr) – Sabah West Coast [Bajaw]
  • Mapun (sjm) – Mapun [Bajaw]

langrev functionality[edit]

If you know a language's code (for example, "en") and you want to find out its canonical name, you can use this:

  • {{#invoke:languages/templates|getByCode|en|getCanonicalName}} (returns "English")

If you know a language's canonical name (for example, "English") and you want to find out its code, use this:

  • {{#invoke:languages/templates|getByCanonicalName|English}} (returns "en")

All of these functions are subst:able (type {{subst:#invoke:...).

codes to merge[edit]

  • Merge 'wof' "Gambian Wolof" into 'wo' "Wolof", or rename the latter "Senegalese Wolof"?