Module talk:hi-translit

Definition from Wiktionary, the free dictionary
Jump to: navigation, search

Module testing[edit]

Testing मानक (mānak) हिन्दी (hindī): [DIRECT MODULE CALL REDACTED], expected: "mānak hindī". --Anatoli (обсудить/вклад) 02:14, 24 April 2013 (UTC)

Testing conjuncts: [DIRECT MODULE CALL REDACTED] --Anatoli (обсудить/вклад) 02:25, 24 April 2013 (UTC)

Testing some text:

Original: हिन्दी संवैधानिक रूप से भारत की प्रथम राजभाषा और भारत की सबसे अधिक बोली और समझी जानेवाली भाषा है। चीनी के बाद यह विश्व में सबसे अधिक बोली जाने वाली भाषा भी है।

हिन्दी और इसकी बोलियाँ उत्तर एवं मध्य भारत के विविध राज्यों में बोली जाती हैं । भारत और अन्य देशों में में ६० करोड़ से अधिक लोग हिन्दी बोलते, पढ़ते और लिखते हैं। फ़िजी, मॉरिशस, गयाना, सूरीनाम की अधिकतर और नेपाल की कुछ जनता हिन्दी बोलती है।

हिन्दी राष्ट्रभाषा, राजभाषा, सम्पर्क भाषा, जनभाषा के सोपानों को पार कर विश्वभाषा बनने की ओर अग्रसर है। भाषा विकास क्षेत्र से जुड़े वैज्ञानिकों की भविष्यवाणी हिन्दी प्रेमियों के लिए बड़ी सन्तोषजनक है कि आने वाले समय में विश्वस्तर पर अन्तर्राष्ट्रीय महत्त्व की जो चन्द भाषाएँ होंगी उनमें हिन्दी भी प्रमुख होगी।:

Translit: [DIRECT MODULE CALL REDACTED]. --Anatoli (обсудить/вклад) 02:36, 24 April 2013 (UTC)

To do[edit]

  1. Remove the inherent vowel (a) from consonants before diacritics, virama and at the end of words (or default them to be without "a" and add it when necessary).
  2. Try to implement the rules of dropping the inherent a.

Should the default consonants be without "a"? --Anatoli (обсудить/вклад) 02:42, 24 April 2013 (UTC)

Not if the same system is to be used for Sanskrit. If you do remove the inherent vowel for Hindi, at least make a separate Module:sa-translit where it isn't removed. —Angr 12:00, 22 June 2013 (UTC)
Fair enough but I'm currently not working on this. I don't know how to do and if it's possible. --Anatoli (обсудить/вклад) 12:09, 22 June 2013 (UTC)


This should transliterated to "shr" as per w:Hunterian transliteration. I don't want to break anything, so can anyone add it? Aryamanarora (talk) 19:52, 27 October 2015 (UTC)

We don't use Hunterian transliteration but a variety of IAST, so "śra" is correct. See also WT:HI TR. --Anatoli T. (обсудить/вклад) 22:40, 27 October 2015 (UTC)
Ahh, okay, thanks for the answer :) Aryamanarora (talk) 00:40, 28 October 2015 (UTC)

Double width tilde[edit]

There's a problem with the combining double-width tilde (U+0360) that's used on the vowels ऐं a͠i and औं a͠u; the combining character is being inserted before the letters when the correct codepoint placement for a double-width combining character is between the two letters. This means it appears straddling the a and the letter before it. (I suspect that the person who made the mistake was using a font that misrendered combining characters too far to the right, causing them to put it in the wrong place). 08:30, 12 May 2016 (UTC)

@Wyang I noticed this as well. —Aryamanarora (मुझसे बात करो) 14:48, 12 May 2016 (UTC)
I changed the placement of the tilde to between the letters. There was probably aesthetic concern by the person who originally suggested placing the tilde before the first letter (compare ha͠i and h͠ai). Wyang (talk) 09:21, 15 May 2016 (UTC)

ज्ञ (gy) transliteration[edit]

@Atitarev, Wyang Wouldn't it be more accurate (and helpful) to transliterate this as gy? —Aryamanarora (मुझसे बात करो) 19:27, 9 April 2017 (UTC)

I don't have a strong opinion on this. I like IAST but we don't follow it 100% for Hindi.--Anatoli T. (обсудить/вклад) 22:01, 9 April 2017 (UTC)
@Aryamanarora It seems to be listed in the translit as gy, yet it still transliterates as . How is that the case? — AWESOME meeos * ([nʲɪ‿bʲɪ.spɐˈko.ɪtʲ]) 22:54, 9 April 2017 (UTC)
@Awesomemeeos I kind of understand Lua, and it seems to me this line text = gsub(text, 'ज्ञ', conv) at the end overrides that somehow? —Aryamanarora (मुझसे बात करो) 22:55, 9 April 2017 (UTC)
@Aryamanarora I think you're right about that... but the problem is whether we should keep the cluster untouched... You should turn the cluster as a comment until further notice.
Since this module very much generates a transcription rather than a transliteration, I think gy would be better. But then, this module isn't entirely transcriptive either... Wyang (talk) 09:59, 10 April 2017 (UTC)
@Aryamanarora, Wyang, DerekWinters Hi. I have formed my opinion now on the transliteration of ज्ञ. I vote for preserving "jñ" as in the Oxford Hindi-English dictionary, →ISBN. ज्ञान (gyān) is transliterated "jñān" (as the current default). Our current Hindi transliteration is quite phonetic, otherwise. I actually prefer we use this dictionary for determining how to deal with shwa-dropping as well for ambiguous cases. --Anatoli T. (обсудить/вклад) 12:12, 11 April 2017 (UTC)


@Aryamanarora The Unicode Standard says this character is specific to Marathi [1] :

Independent vowel for Marathi

I could imagine being used for Hindi, but have you seen it in actual use for Hindi? There may be (possibly prescriptive) rules regarding when to use ॲ, ऑ, but it appears there are many ways people spell a single English word in Devanagari so those rules aren't always applied.

Also, are Marathi modules still reliant on Module:hi-translit, since it says

It is also used to transliterate Marathi (mr), Awadhi (awa), Haryanvi (bgc), Braj (bra) and Garhwali (gbm).

and because the Marathi modules are still in development?

Kutchkutch (talk) 21:00, 8 November 2017 (UTC)

@Kutchkutch: Yes, the "also used to transliterate" list is always up to date, because it is generated by a module that searches for the languages that use this transliteration module. (That is, the languages that have "hi-translit" in their language data file. Marathi's data file is in Module:languages/data2.) — Eru·tuon 23:35, 8 November 2017 (UTC)
Hi @Erutuon:. Thanks for the detailed answer!
I've noticed that you've contributed to Marathi phonology.
I was thinking of making a Module:typing-aids/data/omr based on Module:typing-aids/data/psu for typing in the Modi script, but as Aryamanarora correctly pointed out, all the available Modi fonts do not work properly so there's no point until Noto fonts supports it. Kutchkutch (talk) 00:45, 9 November 2017 (UTC)
@Kutchkutch: Sorry, forgot to reply to this. The only reason I added the character was because Marathi translit was relying on this. I think though MOD:mr-translit is reasonably functioning, so I've switched Marathi to rely on that. I will remove ॲ from this now, since Hindi (or any of the closely related Hindi belt languages that use this module) do not use the chandra a. —Aryaman (मुझसे बात करो) 01:30, 9 November 2017 (UTC)
@Aryamanarora: Thanks for clarifying! If the Marathi's reliance on MOD:hi-translit is removed then any future change to MOD:hi-translit that affects MOD:mr-translit will need to copied manually.
There's always a chance Bombay/Bambaiya Hindi might use through close contact with Marathi, but that would require evidence of repeated usage and not just one instance. Kutchkutch (talk) 01:53, 9 November 2017 (UTC)
@Kutchkutch: No problem! I'm not sure about Bambaiya, since it's so hard to find it written down. —Aryaman (मुझसे बात करो) 02:04, 9 November 2017 (UTC)