Wiktionary:About Marshallese
| This is a Wiktionary policy, guideline or common practices page. This is a draft proposal. It is unofficial, and it is unknown whether it is widely accepted by Wiktionary editors. | |
| Policies: CFI - ELE - BLOCK - REDIR - BOTS - QUOTE - DELETE - NPOV - AXX |
Contents |
Alphabet issues [edit]
There are currently display problem issues with five Marshallese letters:
| Description | Letter | Unicode | Issue |
|---|---|---|---|
| L-cedilla | Ļ, ļ | U+013B, U+013C | Most fonts display this letter with a comma-below diacritic instead of a cedilla, to accommodate the expectations of the Latvian alphabet. |
| M-cedilla | M̧, m̧ | U+004D-U+0327, U+006D-U+0327 | Not encoded as single glyph, and as such requires a combining diacritic that does not display or align properly in most fonts. When displayed properly, the cedilla is placed either beneath the middle of the letter or underneath the rightmost column of the letter (but not too far to the right). |
| N-cedilla | Ņ, ņ | U+0145, U+0146 | Most fonts display this letter with a comma-below diacritic instead of a cedilla, to accommodate the expectations of the Latvian alphabet. |
| N-macron | N̄, n̄ | U+004E-U+0304, U+006E-U+0304 | Not encoded as single glyph, and as such requires a combining diacritic that does not display or align properly in most fonts. |
| O-cedilla | O̧, o̧ | U+004F-U+0327, U+006F-U+0327 | Not encoded as single glyph, and as such requires a combining diacritic that does not display or align properly in most fonts. |
The characters given here are only approximations of the actual characters that are used in careful typesetting, but they are conditionally used until a better solution is found. Note especially that the sequences involving the combining macron character (U+304) or the combining cedilla character (U+0327) will not display correctly in the majority of fonts.
Only the standard diacritics are used in Wiktionary entries for Marshallese. Alternative schemes (particularly the Ḷ ḷ Ṃ ṃ Ṇ ṇ Ñ ñ Ọ ọ alternatives promoted by the Marshallese-English Dictionary) are not used. Three other letters with diacritics, Ā ā Ō ō Ū ū, are already well-displayed in most modern default browser fonts; alternative forms à ã Ä ä Õ õ Ö ö Ũ ũ Ü ü are not used.
Compare:
- Arial: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
- Arial Unicode MS: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
- Calibri: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
- Cambria: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
- Candara: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
- Charis SIL: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
- Code2000: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
- Consolas: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
- Constantia: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
- Corbel: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
- Courier New: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
- Doulos SIL: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
- Gentium: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
- Gentium Plus: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
- Lucida Sans Unicode: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
- Segoe UI: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
- Tahoma: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
- Times New Roman: Ā ā Ļ ļ M̧ m̧ Ņ ņ N̄ n̄ O̧ o̧ Ō ō Ū ū
Pronunciation template [edit]
Marshallese pronunciations are embedded using a special template, {{mh-ipa-rows}}. For example:
{{mh-ipa-rows|mh|hah|ah|J|yeh|lh}}
embeds:
- MED Phonemes: {m̧ahjeļ}
- IPA Phonemes: /mˠaɰtʲɜlˠ/
- IPA Articulation: [mˠɑɑ̯zʲɛ͡ʌɫ]
The template describes both phonology and articulation, with each of the phonetic transcriptions accepting or ignoring certain details. See Marshallese phonology on Wikipedia for details on Marshallese phonotactics. For each code argument:
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Entry collation [edit]
When sorting entries for categorization, a simple ASCII-based transcription can properly collate Marshallese entries in Marshallese word categories. Use all lowercase for sorting, and include spaces and dashes as normally included in the entry. And for letters with diacritics:
| ā | ļ | m̧ | ņ | n̄ | o̧ | ō | ū |
| a~ | l~ | m~ | n~ | n~~ | o~ | o~~ | u~ |
So, a word like M̧ajōļ would be collated m~ajo~~l~.
Marshallese-English Dictionary reference template [edit]
The Marshallese-English Dictionary is the only complete Marshallese dictionary in existence, and has one significant online location. The template {{meod-ref}} links to that location, and the template can be updated in case that location changes. The template can accept up to five arguments, each a separate reference. For each reference, only the URL substring immediately following MOD/ need be provided.