Module talk:hi-translit

Definition from Wiktionary, the free dictionary
Jump to: navigation, search

Module testing[edit]

Testing मानक (mānak) हिन्दी (hindī): [DIRECT MODULE CALL REDACTED], expected: "mānak hindī". --Anatoli (обсудить/вклад) 02:14, 24 April 2013 (UTC)

Testing conjuncts: [DIRECT MODULE CALL REDACTED] --Anatoli (обсудить/вклад) 02:25, 24 April 2013 (UTC)

Testing some text:

Original: हिन्दी संवैधानिक रूप से भारत की प्रथम राजभाषा और भारत की सबसे अधिक बोली और समझी जानेवाली भाषा है। चीनी के बाद यह विश्व में सबसे अधिक बोली जाने वाली भाषा भी है।

हिन्दी और इसकी बोलियाँ उत्तर एवं मध्य भारत के विविध राज्यों में बोली जाती हैं । भारत और अन्य देशों में में ६० करोड़ से अधिक लोग हिन्दी बोलते, पढ़ते और लिखते हैं। फ़िजी, मॉरिशस, गयाना, सूरीनाम की अधिकतर और नेपाल की कुछ जनता हिन्दी बोलती है।

हिन्दी राष्ट्रभाषा, राजभाषा, सम्पर्क भाषा, जनभाषा के सोपानों को पार कर विश्वभाषा बनने की ओर अग्रसर है। भाषा विकास क्षेत्र से जुड़े वैज्ञानिकों की भविष्यवाणी हिन्दी प्रेमियों के लिए बड़ी सन्तोषजनक है कि आने वाले समय में विश्वस्तर पर अन्तर्राष्ट्रीय महत्त्व की जो चन्द भाषाएँ होंगी उनमें हिन्दी भी प्रमुख होगी।:

Translit: [DIRECT MODULE CALL REDACTED]. --Anatoli (обсудить/вклад) 02:36, 24 April 2013 (UTC)

To do[edit]

  1. Remove the inherent vowel (a) from consonants before diacritics, virama and at the end of words (or default them to be without "a" and add it when necessary).
  2. Try to implement the rules of dropping the inherent a.

Should the default consonants be without "a"? --Anatoli (обсудить/вклад) 02:42, 24 April 2013 (UTC)

Not if the same system is to be used for Sanskrit. If you do remove the inherent vowel for Hindi, at least make a separate Module:sa-translit where it isn't removed. —Angr 12:00, 22 June 2013 (UTC)
Fair enough but I'm currently not working on this. I don't know how to do and if it's possible. --Anatoli (обсудить/вклад) 12:09, 22 June 2013 (UTC)


This should transliterated to "shr" as per w:Hunterian transliteration. I don't want to break anything, so can anyone add it? Aryamanarora (talk) 19:52, 27 October 2015 (UTC)

We don't use Hunterian transliteration but a variety of IAST, so "śra" is correct. See also WT:HI TR. --Anatoli T. (обсудить/вклад) 22:40, 27 October 2015 (UTC)
Ahh, okay, thanks for the answer :) Aryamanarora (talk) 00:40, 28 October 2015 (UTC)

Double width tilde[edit]

There's a problem with the combining double-width tilde (U+0360) that's used on the vowels ऐं a͠i and औं a͠u; the combining character is being inserted before the letters when the correct codepoint placement for a double-width combining character is between the two letters. This means it appears straddling the a and the letter before it. (I suspect that the person who made the mistake was using a font that misrendered combining characters too far to the right, causing them to put it in the wrong place). 08:30, 12 May 2016 (UTC)

@Wyang I noticed this as well. —Aryamanarora (मुझसे बात करो) 14:48, 12 May 2016 (UTC)
I changed the placement of the tilde to between the letters. There was probably aesthetic concern by the person who originally suggested placing the tilde before the first letter (compare ha͠i and h͠ai). Wyang (talk) 09:21, 15 May 2016 (UTC)

ज्ञ () transliteration[edit]

@Atitarev, Wyang Wouldn't it be more accurate (and helpful) to transliterate this as gy? —Aryamanarora (मुझसे बात करो) 19:27, 9 April 2017 (UTC)

I don't have a strong opinion on this. I like IAST but we don't follow it 100% for Hindi.--Anatoli T. (обсудить/вклад) 22:01, 9 April 2017 (UTC)
@Aryamanarora It seems to be listed in the translit as gy, yet it still transliterates as . How is that the case? — AWESOME meeos * ([nʲɪ‿bʲɪ.spɐˈko.ɪtʲ]) 22:54, 9 April 2017 (UTC)
@Awesomemeeos I kind of understand Lua, and it seems to me this line text = gsub(text, 'ज्ञ', conv) at the end overrides that somehow? —Aryamanarora (मुझसे बात करो) 22:55, 9 April 2017 (UTC)
@Aryamanarora I think you're right about that... but the problem is whether we should keep the cluster untouched... You should turn the cluster as a comment until further notice.
Since this module very much generates a transcription rather than a transliteration, I think gy would be better. But then, this module isn't entirely transcriptive either... Wyang (talk) 09:59, 10 April 2017 (UTC)
@Aryamanarora, Wyang, DerekWinters Hi. I have formed my opinion now on the transliteration of ज्ञ. I vote for preserving "jñ" as in the Oxford Hindi-English dictionary, ISBN 9780198643395. ज्ञान (jñān) is transliterated "jñān" (as the current default). Our current Hindi transliteration is quite phonetic, otherwise. I actually prefer we use this dictionary for determining how to deal with shwa-dropping as well for ambiguous cases. --Anatoli T. (обсудить/вклад) 12:12, 11 April 2017 (UTC)