Wiktionary:Languages
| This is a Wiktionary policy, guideline or common practices page. Specifically it is a policy think tank, working to develop a formal policy. | |
| Policies: CFI - ELE - BLOCK - REDIR - BOTS - QUOTE - DELETE - NPOV - AXX |
According to our Criteria for inclusion, Wiktionary is intended to include all words in all languages.
Languages may be differentiated through their names, or by codes formed by a few letters.
See Wiktionary:Dialects and Wiktionary:Families for discussions of dialects and of language families, respectively.
Contents |
[edit] Language codes
Language codes are found in topical categories, language-exclusive templates, etymologies, among other pages. There are also situations where language names are used instead.
Each language code is stored in a different template, the name being the code itself. When the template is called, the result is the language name, which may or not be automatically linked to the related Wiktionary entry. For example, when called, the template {{vot}} returns Votic. All language code templates are at Category:Language templates. The space of three letter templates is reserved, with a few exceptions, for language code templates as they do change through time. .
[edit] Code assignment
- Wiktionary language codes come primarily from ISO 639-1, a series of two-letter codes which covers 136 major languages. For other languages, we use ISO 639-3, a series of three-letter codes which grew out of the Ethnologue codes and covers thousands of languages. When, according to our criteria for inclusion, a language is constructed and merits to be defined only in appendices, its template is prefixed with conl:. When a language is a reconstructed ancestor of all languages of a family, its template is prefixed with proto:.
- Examples:
- en is stored at {{en}} and represents English.
- fa is stored at {{fa}} and represents Persian.
- ang is stored at {{ang}} and represents Old English.
- cmn is stored at {{cmn}} and represents Mandarin.
- sjn is stored at {{conl:sjn}} and represents Sindarin.
- gem=pro is stored at {{proto:gem-pro}} and represents Proto-Germanic.
- Examples:
- An exceptional language code may simply come from ISO 639-2, from deprecated ISO 639-1, from deprecated ISO 639-3 or from the Wikimedia language codes.
- Otherwise, the exceptional language code must start with a related ISO 639-5 code. Then, it must be followed by - (an ordinary hyphen) and an extension formed by a few lower case letters. (Therefore, no digits, upper case letters, etc; IANA tags allow these, case independent, but Mediawiki software is more restrictive.)
- Any specific code derived from community consensus and not directly from ISO, is an exceptional code.
[edit] Finding the code for a language
If you don't know the code for a language, you can look it up at Wiktionary:Index to templates/languages. That page lists all possible language codes used on Wiktionary, along with their names. The page is very long, however, and it may be very slow to load on less powerful computers. It is also not possible to automate it in this way using templates.
For that reason the template {{langrev}} can be used instead. It takes one parameter, which is the name of the language, and it returns the code if there is one, or nothing if it can't find any code for that language. So, for example, {{langrev|English}} becomes en, which is the language code for English. To use it, you can either include it on a page and preview or submit it (preferably in the Sandbox), or you can go to Special:ExpandTemplates and type it in there.
[edit] Language names
Language names are found in entry translations, lexical categories, appendices, rhymes, among other pages. There are also situations where language codes are used instead.
- All language names should be defined as English words in their respective Wiktionary entries.
- Each language is consistently referred by only one name, to be used in translations, etymologies and elsewhere. When there are two or more possible names, one of them is chosen.
- The name is chosen by consensus. Common guidelines are to avoid abbreviations, the words "Modern" and "Standard", diacritics and parentheses when possible.
- Multiple names of languages should also eventually be listed in the sections synonyms or alternative forms at each entry.
- Language names from language codes may be linked, to become easier to find information about these languages as Wiktionary entries. Most of them are linked by default, except for those listed at Wiktionary:Translations/Wikification.
[edit] List of languages
[edit] List of languages with exceptional codes
| Name | Wikipedia article | Wiktionary code | Comments |
|---|---|---|---|
| !Kung | w:!Kung language | khi-kun {{khi-kun}} |
|
| Ammonite | w:Ammonite language | sem-amm {{sem-amm}} | |
| Banyumasan | w:Banyumasan language | map-bms {{map-bms}} | |
| Bunurong | w:Bunurong language | aus-bun {{aus-bun}} | |
| Chinese | w:Chinese language | zhx-zho {{zhx-zho}} |
|
| Darkinjung | w:Darkinjung language | aus-dar {{aus-dar}} | |
| Dutch Low Saxon | w:Dutch Low Saxon | nds-nl {{nds-nl}} |
|
| Gabi | w:Pama-Nyungan languages#Classification and Languages | aus-gab {{aus-gab}} | |
| Gallo | w:Gallo language | roa-gal {{roa-gal}} |
|
| Gaulish | w:Gaulish language | cel-gau {{cel-gau}} | |
| Greenlandic Eskimo Pidgin | w:Indigenous languages of the Americas#Pidgins, mixed languages and trade languages | crp-gep {{crp-gep}} | |
| Guernésiais | w:Guernésiais | roa-grn {{roa-grn}} |
|
| Gunai | w:Gunai language | aus-gun {{aus-gun}} |
|
| Gutnish | w:Modern Gutnish | gmq-gut {{gmq-gut}} |
|
| Jèrriais | w:Jèrriais | roa-jer {{roa-jer}} |
|
| Leonese | w:Leonese language | roa-leo {{roa-leo}} | |
| Mandarin | w:Mandarin Chinese | zh {{zh}} |
|
| Maroon Spirit Language | w:Jamaican Maroon Spirit Possession Language | cpe-mar {{cpe-mar}} | |
| Middle Chinese | w:Middle Chinese | zhx-mid {{zhx-mid}} | |
| Middle Norwegian | w:Norwegian language#From Old Norse to distinct Scandinavian languages | gmq-mno {{gmq-mno}} | |
| Mingo | w:Mingo | iro-min {{iro-min}} | |
| Moldavian | w:Moldovan language | mo {{mo}} |
|
| Montenegrin | w:Montenegrin language | zls-mon {{zls-mon}} |
|
| Nahuatl | w:Nahuatl | nah {{nah}} |
|
| Norfuk | w:Norfuk language | cpe-nor {{cpe-nor}} |
|
| Norman | w:Norman language | roa-nor {{roa-nor}} |
|
| Old Danish | w:Old Danish | gmq-oda {{gmq-oda}} | |
| Old Novgorod dialect | w:Old Novgorod dialect | zle-nov {{zle-nov}} | |
| Old Polish | w:Old Polish language | zlw-opl {{zlw-opl}} | |
| Old Portuguese | w:Galician Portuguese | roa-ptg {{roa-ptg}} | |
| Old Swedish | w:Swedish language#Old Swedish | gmq-osw {{gmq-osw}} | |
| Phuthi | w:Phuthi language | bnt-phu {{bnt-phu}} | |
| Picuris | w:Picuris language | nai-pic {{nai-pic}} | |
| Pitkern | w:Pitkern | cpe-pit {{cpe-pit}} |
|
| Pomeranian | w:Pomeranian language | zlw-pom {{zlw-pom}} | |
| Russenorsk | w:Russenorsk | crp-rsn {{crp-rsn}} | |
| Samoan Plantation Pidgin | w:Samoan Plantation Pidgin | cpe-spp {{cpe-spp}} | |
| Serbo-Croatian | w:Serbo-Croatian language | sh {{sh}} |
|
| Slovincian | w:Slovincian | zlw-slv {{zlw-slv}} |
|
| Sydney | w:Sydney language | aus-syd {{aus-syd}} |
|
| Syrian Arabic | w:Syrian Arabic | sem-syr {{sem-syr}} | |
| Taimyr Pidgin Russian | crp-tpr {{crp-tpr}} | ||
| Tarantino | w:Tarantino language | roa-tar {{roa-tar}} | |
| Wemba-Wemba | w:Wemba-Wemba | aus-wem {{aus-wem}} | |
| Woiwurrung | w:Woiwurrung language | aus-wwg {{aus-wwg}} | |
| Zamboanga Chavacano | w:Chavacano language#Zamboangueño | cbk-zam {{cbk-zam}} |
|
[edit] List of appendix-only constructed languages
| Name | Wikipedia article | Wiktionary code | Comments |
|---|---|---|---|
| A Programming Language | w:APL (programming language) | art-apl {{conl:art-apl}} | |
| Eloi language | w:Eloi language | art-elo {{conl:art-elo}} | |
| Hyper Text Markup Language | w:HTML | art-html {{conl:art-html}} | |
| Klingon | w:Klingon language | tlh {{conl:tlh}} | |
| Láadan | w:Láadan | ldn {{conl:ldn}} | |
| Lapine | w:Lapine language | art-lap {{conl:art-lap}} | |
| Mandalorian | w:Mandalorian#Language | art-man {{conl:art-man}} | |
| Mundolinco | w:Mundolinco | art-mun {{conl:art-mun}} | |
| Na'vi | w:Na'vi language | art-nav {{conl:art-nav}} | |
| Neo | w:Neo (constructed language) | art-neo {{conl:art-neo}} | |
| Quenya | w:Quenya | qya {{conl:qya}} | |
| Sindarin | w:Sindarin | sjn {{conl:sjn}} | |
| Toki Pona | w:Toki Pona | art-top {{conl:art-top}} |