User:Neitrāls vārds/sandbox

From Wiktionary, the free dictionary
Jump to navigation Jump to search

Possible test cases[edit]

  • ッチ – tci
  • ットゥ – ttu
  • ゥイ – uy
  • クッコㇿウシ – kutkor'usi (the apostrophe is debatable, perhaps should be ignored)
  • ンモ – mmo
  • サッポロ – Satporo (should be entered with a period サッ.ポロ to have small tsu behave like t and not gemination sign.)
  • ア=エホッケ – a=ehotke
  • ウイユㇷ゚ケ – uiyupke (V + i = Vy unless following starts with y)
  • ニウロッケ – niwrotke
  • ラッチャコ – ratcako

V + u = Vw

Provisional pairs and conditions[edit]

  • Basic correspondences (adapted from ja module not known if all kana is relevant in ain, she/sha, etc. removed because sh/s sounds appear universally orthographically interchangeable. Same for tse, tsa, etc. appear to occur only on morphemic boundary t + se, t + sa, etc. Many others possibly could be removed (l-row, v-row, etc.) The only real difference is shi = si and chi, chu, cha = ci, cu, ca, etc.
["ア"] = "a", ["イ"] = "i", ["イィ"] = "yi", ["イェ"] = "ye", ["ウ"] = "u", ["ヴ"] = "vu", ["ヴァ"] = "va", ["ヴィ"] = "vi", ["ヴィェ"] = "vye", ["ヴェ"] = "ve", ["ヴォ"] = "vo", ["ヴャ"] = "vya", ["ヴュ"] = "vyu", ["ヴョ"] = "vyo", ["ウァ"] = "wa", ["ウィ"] = "wi", ["ウゥ"] = "wu", ["ウェ"] = "we", ["ウォ"] = "wo", ["ウュ"] = "wyu", ["エ"] = "e", ["オ"] = "o", ["カ"] = "ka", ["キ"] = "ki", ["キェ"] = "kye", ["キャ"] = "kya", ["キュ"] = "kyu", ["キョ"] = "kyo", ["ガ"] = "ga", ["ギ"] = "gi", ["ギェ"] = "gye", ["ギャ"] = "gya", ["ギュ"] = "gyu", ["ギョ"] = "gyo", ["ク"] = "ku", ["クァ"] = "kwa", ["クィ"] = "kwi", ["クェ"] = "kwe", ["クォ"] = "kwo", ["クヮ"] = "kwa", ["グ"] = "gu", ["グァ"] = "gwa", ["グィ"] = "gwi", ["グェ"] = "gwe", ["グォ"] = "gwo", ["グヮ"] = "gwa", ["ケ"] = "ke", ["ゲ"] = "ge", ["コ"] = "ko", ["ゴ"] = "go", ["サ"] = "sa", ["ザ"] = "za", ["シ"] = "si", ["ジ"] = "ji", ["ジェ"] = "je", ["ジャ"] = "ja", ["ジュ"] = "ju", ["ジョ"] = "jo", ["ス"] = "su", ["スィ"] = "si", ["ズ"] = "zu", ["ズィ"] = "zi", ["セ"] = "se", ["ゼ"] = "ze", ["ソ"] = "so", ["ゾ"] = "zo", ["タ"] = "ta", ["ダ"] = "da", ["チ"] = "ci", ["チェ"] = "ce", ["チャ"] = "ca", ["チュ"] = "cu", ["チョ"] = "co", ["ヂ"] = "ji", ["ヂャ"] = "ja", ["ヂュ"] = "ju", ["ヂョ"] = "jo", ["ヅ"] = "zu", ["テ"] = "te", ["ティ"] = "ti", ["テュ"] = "tyu", ["デ"] = "de", ["ディ"] = "di", ["デュ"] = "dyu", ["ト"] = "to", ["トゥ"] = "tu", ["ド"] = "do", ["ドゥ"] = "du", ["ナ"] = "na", ["ニ"] = "ni", ["ニェ"] = "nye", ["ニャ"] = "nya", ["ニュ"] = "nyu", ["ニョ"] = "nyo", ["ヌ"] = "nu", ["ネ"] = "ne", ["ノ"] = "no", ["ハ"] = "ha", ["バ"] = "ba", ["パ"] = "pa", ["ヒ"] = "hi", ["ビ"] = "bi", ["ピ"] = "pi", ["ヒェ"] = "hye", ["ヒャ"] = "hya", ["ヒュ"] = "hyu", ["ヒョ"] = "hyo", ["ビェ"] = "bye", ["ピェ"] = "pye", ["ビャ"] = "bya", ["ピャ"] = "pya", ["ビュ"] = "byu", ["ピュ"] = "pyu", ["ビョ"] = "byo", ["ピョ"] = "pyo", ["フ"] = "fu", ["ファ"] = "fa", ["フィ"] = "fi", ["ブィ"] = "bi", ["フィェ"] = "fye", ["フェ"] = "fe", ["フォ"] = "fo", ["フャ"] = "fya", ["フュ"] = "fyu", ["フョ"] = "fyo", ["ブ"] = "bu", ["プ"] = "pu", ["ヘ"] = "he", ["ベ"] = "be", ["ペ"] = "pe", ["ホ"] = "ho", ["ボ"] = "bo", ["ポ"] = "po", ["ホゥ"] = "hu", ["マ"] = "ma", ["ミ"] = "mi", ["ミェ"] = "mye", ["ミャ"] = "mya", ["ミュ"] = "myu", ["ミョ"] = "myo", ["ム"] = "mu", ["メ"] = "me", ["モ"] = "mo", ["ヤ"] = "ya", ["ユ"] = "yu", ["ヨ"] = "yo", ["ラ"] = "ra", ["リ"] = "ri", ["ラ゜"] = "la", ["リ゜"] = "li", ["リェ"] = "rye", ["リャ"] = "rya", ["リュ"] = "ryu", ["リョ"] = "ryo", ["ル"] = "ru", ["ル゜"] = "lu", ["レ゜"] = "le", ["レ"] = "re", ["ロ"] = "ro", ["ロ゜"] = "lo", ["ワ"] = "wa", ["ヷ"] = "va", ["ヰ"] = "wi", ["ヸ"] = "vi", ["ヱ"] = "we", ["ヹ"] = "ve", ["ヲ"] = "o", ["ヺ"] = "vo", ["ン"] = "n@"
  • Small kana correspondences (small-tsu is a special case.) Entered as entities of Hex codes listed on this page, so hopefully correct.
["ㇰ"] = "k", ["ㇱ"] = "s", ["ㇲ"] = "s", ["ㇳ"] = "t", ["ㇴ"] = "n", ["ㇵ"] = "h", ["ㇶ"] = "h", ["ㇷ"] = "h", ["ㇸ"] = "h", ["ㇹ"] = "h", ["ㇺ"] = "m", ["ㇻ"] = "r", ["ㇼ"] = "r", ["ㇽ"] = "r", ["ㇾ"] = "r", ["ㇿ"] = "r"
  • Uncertain: when a small kana is followed by a vowel, e.g., r + u, TDA indicates a glottal stop(?) with an apostrophe = r'u, this additional sign has not been seen on Ainu Times, for example. Perhaps should be ignored until further confirmation.
  • Small tsu is a t under the following circumstances: nothing follows it (it's the last character in a word), a period . follows it (additional sign to block gemination) or a space follows it. When something else follows it (likely a consonant-row kana and there's no period in between,) it behaves exactly as in ja – gemination sign.
["ッ"] = "t"
    • Based on the above any periods entered need to be stripped from output.
  • Gemination, as mentioned, identical to the rules in ja module, including ci – geminate tci (like ja chi – tchi)
  • If an i follows a vowel it becomes y – a + i + nu = aynu.
    • Exception: when something starting with y follows i following a vowel, no change takes place – a + i + yu = aiyu (not ayyu)
  • When an u follows a vowel it becomes w – i + u = iw.
    • Uncertain: perhaps like with i-y it would be reversed if something starting with w followed, e.g., i + u + wa would be iuwa not iwwa but it is uncertain if there actually exist any words of this pattern.
  • This is somewhat uncertain but, TDA shows this in their orthography: when an n (it was defined as n@ in the pairs) is followed by m, b or p it becomes m.