Template talk:ja-pron

From Wiktionary, the free dictionary
(Redirected from Module talk:ja-pron)
Jump to navigation Jump to search

Initial discussion[edit]

Discussion moved from Talk:精神分裂病.


Having done Module:ko-pron, I'd like to start the work on Module:ja-pron now. Wyang (talk) 03:19, 16 April 2014 (UTC)[reply]
@Eirikr. Might as well move this discussion to a more visible place. ;) --Anatoli (обсудить/вклад) 03:23, 16 April 2014 (UTC)[reply]
  • @Eirikr OK. I'll think of a different location. If you have a way to contact Haplology, please let him know we need him here! Take it easy and come back when you can.
  • @Wyang. I'm happy to do the testing and using the future module but I'd need to brush up my IPA for Japanese. We'll have to rely on your skills and knowledge again. :) がんばってね!--Anatoli (обсудить/вклад) 06:17, 16 April 2014 (UTC)[reply]


Thanks. I've done a crude version at Template:ja-pron and Module:ja-pron. Currently,

{{ja-pron|せいしん ぶんれつ びょう|acc=h|y=on}}

generates

Lua error in Module:ja-pron at line 77: Parameter "y" is not used by this template.

Any suggestions? Wyang (talk) 07:52, 16 April 2014 (UTC)[reply]

Well done! No suggestions yet but some documentation would be helpful, specifically on parameters and types of accents. It's for 標準語, isn't it or for any variety? Should the template link to/mention the variety name? --Anatoli (обсудить/вклад) 08:01, 16 April 2014 (UTC)[reply]
Can we also have Accent: 0, Accent: 1, 'etc. next to accent names? --Anatoli (обсудить/вклад) 08:14, 16 April 2014 (UTC)[reply]
It would refer to standard Japanese, although I don't know where that information should be placed. How do the accent numbers correspond to accent types 'h,o,a,n'? Japanese pitch accent isn't very helpful. Do they refer to the accented morae? (in which case, nakadaka could get more than one number, correct?) Wyang (talk) 13:10, 16 April 2014 (UTC)[reply]
Letters must be the best way, then. Since we don't know if a given transcription is for the standard Japanese, I'll drop this request as well. They can be marked in brackets for any variety, potentially. Do you think we should have a default IPA info, when the pitch is unknown? Unfortunately, I don't have enough sources for the Japanese accents, nothing online, only some old Japanese-Russian dictionaries with accents. --Anatoli (обсудить/вклад) 22:02, 16 April 2014 (UTC)[reply]


  • Looks good. I tweaked the module to replace ɽ with ɾ̠. The former is a retroflex tap, not used in Japanese, while the latter is more generally accepted as the /r/ tap preceding an /a/ sound.
@Atitarev you're talking about the numbers used in some dictionaries to indicate the number of the syllable right after which the downstep in pitch occurs, yes? Or something else? If you mean the downstep syllable, calling it accent isn't quite correct. If you mean something more like dialect, maybe some other less ambiguous term could be used.
@Atitarev, Wyang {{ja-accent-common}} refers to 標準語 as defined by Tokyo standard pronunciation and the NHK pronunciation guidelines for broadcasters. I haven't seen any resources that give pitch information for any other dialects, but I would be quite happy to include those, provided we can find such resources.
With that in mind, is there any easy way to make the module, um, modular :), to allow for pluggable pitch sub-modules or functions? I haven't gone through the code at all really, I just made that one change to swap out ɽ.
Also, @Wyang the pitch drop on long vowels looks plug-ugly at カリフラワー, at least on my machines -- the ` invalid IPA characters (`) that's supposed to show the lower pitch shows up too far to the right, so that it's not even over the vowel at all, appearing instead as a stray mark over the closing square bracket.
Lastly, combos like 'çʲj' don't quite work -- this should just be 'çj' instead, with just the main palatal glide of /j/.
Cheers folks, thank you for your help with this! ‑‑ Eiríkr Útlendi │ Tala við mig 22:03, 16 April 2014 (UTC)[reply]
  • Oh, and to clarify, when I say I haven't seen any resources that give pitch information for any other dialects, I mean on a word-by-word basis. Shibatani and others do discuss the broad trends of pitch patterns, but the only lexicographical information about pitch that I've seen in actual dictionaries and the like has been for Tokyo dialect. ‑‑ Eiríkr Útlendi │ Tala við mig 22:05, 16 April 2014 (UTC)[reply]
  • Oo, also, じ should be rendered as d͡ʑi, not (d͡)ʑʲi (not really clear what the parens are doing there, and no need for the small "j"). C.f. 御御籤.
For compatibility (and legibility) purposes, {{ja-pron}} should support yomi as a synonym param for y.
And, how / where do we put the reference footnote? If I put it right after the call to {{ja-pron}}, the footnote shows up on the IPA line -- which isn't correct, since I'm using the reference for the pitch accent, not the IPA. See 御御籤 again for an example.
Thank you again! ‑‑ Eiríkr Útlendi │ Tala við mig 22:27, 16 April 2014 (UTC)[reply]
@Wyang Re: "nakadaka could get more than one number, correct". Does that mean than "n" may not be sufficient? I'll dig up my old dictionaries/textbooks and check if there is a straightforward mapping between numbers and letters (pitch accent names). --Anatoli (обсудить/вклад) 22:32, 16 April 2014 (UTC)[reply]
Pitch and names and numbers:
  • Heiban (平板, “flat”): Pitch rises after first syllable, falls gradually thereafter. Pitch number 0 -- no downstep.
  • Atamadaka (頭高, “high head”): First syllable takes high pitch, downstep immediately thereafter. Pitch number 1 -- downstep after first mora.
  • Nakadaka (中高, “high middle”): Pitch rises after first syllable, downstep after some number of morae. Can only apply to terms with at least 3 morae. Pitch number varies, must be at least 2, and less than the total number of morae in the term -- downstep after mora indicated by number.
  • Odaka (尾高, “high tail”): Pitch rises after first syllable, downstep after last mora. Pitch number varies, must be the number of the last mora in the term -- downstep after mora indicated by number. For odaka terms, the downstep is actually heard on the following particle.
Hope that clarifies! ‑‑ Eiríkr Útlendi │ Tala við mig 23:45, 16 April 2014 (UTC)[reply]
Thanks. You have just confirmed that for Nakadaka there are variants. That means that just using "n" won't produce 100% correct pitch accent. Users may want to know when the downstep starts on long words, right? Please clarify. --Anatoli (обсудить/вклад) 00:08, 17 April 2014 (UTC)[reply]

Thanks.

  1. 'replace ɽ with ɾ̠' - great
  2. 'the accent fall on long vowels looks plug-ugly'
    Um... not sure how to solve this. Unicode only has combined grave-macron for e (ḕ) and o (ṑ). It is caused by the font formatting <tt>kàrífúráwā̀</tt> (kàrífúráwā̀). cf. normal unformatted 'kàrífúráwā̀'. We could decompose it into single vowels, eg. kàrífúráwàà, or use either no formatting or some other font (which I don't know). For now I have decomposed it into single vowels and it now looks like Navajo.
  3. Palatalisation. It is currently consistently marked. Questions are:
    1. Should it be consistently marked? eg. mi -> mʲi, ki -> kʲi, çj -> çʲj. I have removed this now.
    2. Should it be marked by default after ɕ, t͡ɕ and d͡ʑ, or when those are followed by non-'ij', or not at all? This version of 統合失調症 used the second option. I have changed it to match that.
  4. About '(d͡)ʑ'... Japanese phonology says d͡ʑ ~ ʑ and d͡z ~ z are in free variation for romaji 'j' and 'z', respectively. Hence the notation there... Should these be written as 'd͡ʑ' and 'd͡z' regardless of the environments they are in? I have converted them to 'd͡ʑ' and 'd͡z' for now.
  5. I have added |yomi, |accent, |accent2, |acc_loc, |accent_loc (these are 'Tokyo' by default), |acc_ref, |accent_ref ('DJR' by default), |acc2_loc, |accent2_loc ('Tokyo' by default), |acc2_ref, |accent2_ref ('DJR' if |acc2 exists).

Anatoli: The single-letter accent types in that template mainly match {{ja-accent-common}}, except 'nakadaka' which needs further specifying. Thus |acc=o (o), |acc=a (a), |acc=h (h), |acc=2,3 (n), |acc=2,2 (n). I'm not sure how the accent numbers correspond to this. Maybe they are positions of accented morae? Wyang (talk) 00:20, 17 April 2014 (UTC)[reply]

  • @Wyang re: single-letter accent types, see above about Pitch and names and numbers.  :) Theoretically, it should be possible to specify h, a, or o without needing any number. Only nakadaka would require a number to be able to figure out where the downstep happens. As such, one should ideally be able to specify nakadaka using 'nX, where X is the number, or by using X alone.
For that matter, it might make sense to allow accent types to also be specified by number alone, where 0 or 1 would be heiban or atamadaka respectively, and any greater value would wind up as odaka or nakadaka, depending on how many morae are in the term.
  • Re: d͡ʑ ~ ʑ and d͡z ~ z, d͡z happens, but is rarer. Likewise, ʑ happens, but is rarer. This is mostly an issue of geographical variations in dialect. For NHK purposes (i.e. one of the closest things to a standard pronunciation), my understanding is that romaji "j" == [d͡ʑ], and romaji "z" == [z]. This gets complicated, but it might make sense in the longer term to add a param to allow for specifying this variation, since I think it might sometimes be contrastive and / or emphasized in certain careful speech.
Similarly, whether or not certain /i/ or /u/ sounds are unvoiced should also be specifiable. is つき in hiragana, and is usually [t͡sɯ̥ᵝki] as I hear it. Meanwhile, 付き as in about, regarding is also つき in hiragana, and is usually [t͡sɯᵝki] as I hear it. So it's not really possible to tell just from the kana spelling whether a given /i/ or /u/ is unvoiced.
How is non-initial /g/ handled? Are both g/ŋ produced? Please demonstrate on ありがとう.
To me, it seems most Japanese who start learning foreign languages late, have difficulty pronouncing /ʑ/, even if they make an effort. :) --Anatoli (обсудить/вклад) 01:11, 17 April 2014 (UTC)[reply]

Thanks.

  • For arigatou:
  • IPA(key): [a̠ɾʲiɡa̠to̞ː]
  • d͡z: changed to 'z'.
  • For vowel devoicing: There was a rule in the module, which devoices vowels between voiceless consonants, and then only keeps the first when two devoiced vowels occur in adjacent morae. I have removed that rule and added a |dev= parameter. Please see Template:ja-pron/documentation.
  • I have added |acc_note, |accent_note, |acc2_note, |accent2_note, which are placed at the end of the accent line.
  • Accent types '0' and '1' treated as 'h' and 'a'. If not single characters 'hao01', then remove 'o'. If resulting string is equal to the length of text, then 'o'. If not, then 'n'. eg. |acc=0 (h), |acc=1 (a), |acc=h (h), |acc=2 for 2-morae word (o), |acc=3 for 3-morae word (n3), |acc=o for 5-morae word (o).
  • I have added an accent reference template so that |acc_ref=NHK etc. can now call the reference template. (Template:ja-pron/documentation)

How about now? Wyang (talk) 02:10, 17 April 2014 (UTC)[reply]

More thoughts :) --
  • There are sometimes more than just two pitch accent patterns. The most I can recall running into is three, but I suppose it's possible that a handful of terms might even have four.
  • I'm changing the description for the dev param -- the number should really be described as the mora number, as some syllabic analyses would give incorrect results. For instance, かんした could be analyzed as having two syllables (sounding like /kan.ɕta/ in casual speech), but four morae, and the devoiced mora is the third one.
  • For references, I think it's best to have the default be nothing. There are terms where Daijirin doesn't include any pitch accent, and I've misplaced my NHK pronunciation dictionary (probably in a box in storage), but I work with native speakers and sometimes crib from them. In these cases, I deliberately don't list any reference, since there isn't really any -- but I think the pitch information is important enough to include, until such time as I can find a real reference to add.
Also, by You can also do it the conventional way, |acc_ref=[1] -- do you mean that it's possible to add the call to {{R:Daijisen}}, etc., directly as the acc_ref param value?
Thanks again, again! :D ‑‑ Eiríkr Útlendi │ Tala við mig 19:23, 17 April 2014 (UTC)[reply]


  • I have added |acc3, |acc4 and |acc5, since there might be occasions where non-Tokyo accent patterns would like to be specified too.
  • 'dev': I think I might have described it inaccurately... As the module analyses it, the 'dev' parameter is the position of the devoiced syllable in the kana string. eg. hyakushou should have |dev=3 (not 2), and だいこん やくしゃ should have |dev=6 (spaces are not counted). This is inconsistent with the format of the accent parameter, but I think it is easier to specify and easier for the module to handle.
  • 'ref': Oops, I forgot to put nowiki tags around it. It should read |acc_ref=<ref name="NHK">{{R:NHK Hatsuon}}</ref>.
  • 'default ref': I have removed the default reference, so that there is no reference listed when the parameter is unspecified.
  • 'dehijacking the talkpage': I agree... Hence it is here now.

Thanks. Wyang (talk) 22:01, 17 April 2014 (UTC)[reply]

References[edit]


Long vowel oddity spotted at パーソナルコンピューター[edit]

Just created this entry, and noticed that the ピュー got romanized oddly in the romaji-with-tone-marks bit in the pronunciation section:

[pàásónárúkóńpyúーtàà]

I'm about to log off for the night. If you have time, could someone look at the module and see what's going on there?

Cheers, ‑‑ Eiríkr Útlendi │ Tala við mig 04:53, 20 April 2014 (UTC)[reply]

Sorry for the delay. Fixed now. Wyang (talk) 08:02, 22 April 2014 (UTC)[reply]

Pitch on moraic /n/ > ん[edit]

I was reformatting 日本 and noticed that the downstep that occurs on the final ん isn't being indicated in the romanized version with tone marks. For instance, {{ja-pron}} is giving [nìhón] and [nìppón], when it should be outputting [nìhóǹ] and [nìppóǹ] instead.

For that matter, even if there were no downstep, the template should still show tone marks for moraic /n/. Could that be fixed? ‑‑ Eiríkr Útlendi │ Tala við mig 05:59, 21 April 2014 (UTC)[reply]

Thanks, I think it's fixed now. Wyang (talk) 08:02, 22 April 2014 (UTC)[reply]

Displaying numbers next to pitch accent names[edit]

@Wyang It would be good to add numbers next pitch accent names, similar to how some paper and online dictionaries mark accents, e.g. 現在 on Weblio. it would make it easier to cross-reference Wiktionary accent names to those numbers. User:Eirikr seems to agree. Do you think it's a good idea? --Anatoli (обсудить/вклад) 01:45, 20 June 2014 (UTC)[reply]

Added now. Wyang (talk) 02:22, 20 June 2014 (UTC)[reply]
Thank you. I've added [ ]. Is that OK? From what I've seen so far, either a superscript number is used or a number in square brackets. --Anatoli (обсудить/вклад) 03:01, 20 June 2014 (UTC)[reply]
Yes, please prettify anything. :) Wyang (talk) 03:58, 20 June 2014 (UTC)[reply]

宿題[edit]

@Wyang, @Eirikr

It didn't work on 宿題 (しゅくだい), the しゅ part. --Anatoli (обсудить/вклад) 02:19, 2 July 2014 (UTC)[reply]

Why is |dev=1? I thought vowel devoicing only occurs interconsonantally. Wyang (talk) 06:48, 2 July 2014 (UTC)[reply]
It's between two devoiced consonants ɕ and k. Same with しかし (working fine) and 少し (adding now). NHK even uses a similar notation to ours for devoiced vowels. --Anatoli (обсудить/вклад) 07:20, 2 July 2014 (UTC)[reply]
It should be |dev=2 instead. |dev= is the position of kana with devoiced vowel in the input kana string. Wyang (talk) 07:31, 2 July 2014 (UTC)[reply]
Oh, thank you. Silly me. :) --Anatoli (обсудить/вклад) 07:37, 2 July 2014 (UTC)[reply]
  • Just saw this thread again. Wyang, when kana compounds like しゅ are devoiced, the whole thing should be marked as devoiced, like しゅ. Marking it as し makes it look like [ɕiɯ̥ᵝ] or some such oddness, when what we want to indicate instead is [ɕɯ̥ᵝ] or [ɕʲɯ̥ᵝ].. ‑‑ Eiríkr Útlendi │ Tala við mig 01:40, 5 March 2015 (UTC)[reply]
@Eirikr I didn't notice this, thanks. You're right. I'm also inviting you to join Wiktionary:Beer_parlour/2015/February#Simplification_of_topic_categories_adding, which may affect Japanese categories, hopefully for the better, if implemented. --Anatoli T. (обсудить/вклад) 02:00, 5 March 2015 (UTC)[reply]

dev2, dev3?[edit]

@Wyang Frank, can there be more dev's, please, as in 蛋白質 to get [tã̠mpa̠kɯ̥ᵝɕit͡sɯ̥ᵝ], e.g. ...|dev=4|dev2=6...? --Anatoli T. (обсудить/вклад) 03:22, 28 January 2015 (UTC)[reply]

OK, second devil added. I want to rewrite its code... so that the devils can be written as たんぱ(く)し(つ), avoiding the need for |dev11=. Keep it like this for now, I will change the format if I ever get around to doing that... Wyang (talk) 03:43, 28 January 2015 (UTC)[reply]
Ah, thanks. I guess all uses will need to be updated? --Anatoli T. (обсудить/вклад) 03:50, 28 January 2015 (UTC)[reply]
That's not a big problem when done semi-automatically. We've managed to do all the Chinese format changes... Wyang (talk) 03:53, 28 January 2015 (UTC)[reply]
You're genius. :) --Anatoli T. (обсудить/вклад) 04:18, 28 January 2015 (UTC)[reply]
@Wyang Hi Frank, I'm back in Melbourne after three weeks in France (also a bit of Belgium) I'm eager to see the change, as 少し and しかし also need to be fixed. :) --Anatoli T. (обсудить/вклад) 01:20, 5 March 2015 (UTC)[reply]

Bug with sutegana at the beginning of a term[edit]

{{ja-pron|ふぁふぃとぅふぇふぉふぁ|acc=1}} {{ja-pron|ふぃとぅふぇふぉふぃ|acc=1}} {{ja-pron|とぅふぇふぉふぃ|acc=1}} "fúァ" and "fúィ" probably isn't desirable—umbreon126 07:37, 7 March 2015 (UTC)[reply]

'tis okay now —umbreon126 05:11, 27 March 2015 (UTC)[reply]
Thanks, User:Wyang! --Anatoli T. (обсудить/вклад) 05:19, 27 March 2015 (UTC)[reply]
No worries. :) Wyang (talk) 20:58, 27 March 2015 (UTC)[reply]

Delimiting vowels[edit]

For example, on 女王 (applies to the readings じょおう [2] and にょおう [2]), there is a need to delimit the vowels for the IPA to render properly, but when this is done using . as is done in ja-noun, the . is printed in the kana and it also messes up the accent because the dot gets counted as a kana. Nibiko (talk) 08:25, 4 June 2015 (UTC)[reply]

Fixed by Kc kennylau! Thank you so much, Kc kennylau! <3 Nibiko (talk) 04:36, 25 April 2016 (UTC)[reply]

Twofold long vowels[edit]

Different to my above-mentioned concern, I noticed that on 蓊鬱 おううつ is represented as òóótsú just before the section where it says Heiban. I would expect it to be òóútsú. Nibiko (talk) 03:12, 24 August 2015 (UTC)[reply]

[çiβa̠kɯ̥ᵝɕʲa̠][edit]

It gives the pronunciation [çiβa̠kɯ̥ᵝɕʲa̠] for 被爆者. I don’t know where this [β] comes from. The intervocalic /ɡ/ is often realized as a fricative but the phoneme /b/ doesn’t change at least in my pronunciation. In addition, [ɕʲ] is redundant because [ɕ] is already palatalized. — TAKASUGI Shinji (talk) 12:27, 9 September 2015 (UTC)[reply]

@Wyang. --Anatoli T. (обсудить/вклад) 00:03, 14 July 2016 (UTC)[reply]
Thanks. [β] is from Japanese phonology#Weakening - should the rule be removed? Removed palatalisation of [ɕʑ]. Wyang (talk) 00:11, 14 July 2016 (UTC)[reply]
I think we should remove the rule of β. — TAKASUGI Shinji (talk) 06:11, 14 July 2016 (UTC)[reply]
Ok no problem. Removed. Thanks! Wyang (talk) 09:01, 14 July 2016 (UTC)[reply]
Thank YOU! — TAKASUGI Shinji (talk) 00:53, 15 July 2016 (UTC)[reply]

Sorting[edit]

Can a sort parameter be added to the template? —britannic124 (talk) 16:28, 13 July 2016 (UTC)[reply]

  • Since this template already uses a kana-ized string as its primary input, a sort key shouldn't be necessary.
I do see that the underlying module is not changing katakana to hiragana for sorting purposes, but this should be fixed in the module itself, so that the sort key is correctly and automatically derived from the data that the module is already using. @Wyang, is that something you could do? If not, could you ping someone who could? ‑‑ Eiríkr Útlendi │Tala við mig 18:14, 13 July 2016 (UTC)[reply]
Sure, I added sortkeys to the IPA and audio categories. It's a bit of an ugly hack though, since those templates do not seem to support |sort=. Wyang (talk) 22:20, 13 July 2016 (UTC)[reply]

ん (‘n’) before approximates[edit]

Shouldn’t “n” before “w” be represented as [ɰ̃ᵝ], like in “denwa” [dẽ̞ɰ̃ᵝɰᵝa̠]? (Or least [dẽ̞ɴɰᵝa̠]?) And “n” before “y” as [j̃], like in “shin’ya” [ɕĩj̃ja̠]? —britannic124 (talk) 18:02, 2 September 2016 (UTC)[reply]

Recent change to Module:ja-pron[edit]

Hi @Eirikr! Just letting you know that there was a change to Module:ja-pron recently by User:Nardog. I'm not qualified to comment on the IPA changes, but I know you are definitely. :) Wyang (talk) 09:17, 17 May 2017 (UTC)[reply]

dev2[edit]

Since the Japanese attention category has no organisation, I'm leaving my concern here. It would be good if you could override the value of the dev parameter for a certain accent, as this would allow to express exceptions in a single use of the template. See 増幅器 and 屹度. Nibiko (talk) 13:02, 19 June 2017 (UTC)[reply]

Co-occurring pitch accents[edit]

ja-pron does not currently support co-occurring pitch accents. If a term has multiple pitch accents divided across words, then see 因果応報 and 一期一会 for the current way to format them. Nibiko (talk) 02:29, 29 June 2017 (UTC)[reply]

Error in display of devoiced vowels[edit]

{{ja-pron|だいこん やくしゃ|acc=5|acc_ref=DJR,NHK|dev=6|y=o}}

currently yields (refs removed)

[dàíkóń yáꜜkùshà] should be [dàíkóń yáꜜkùshà], to match the hiragana (and because apparently only u and i can be devoiced). Something must be making the function think there's an extra vowel or mora somewhere before the second word. — Eru·tuon 18:42, 20 August 2017 (UTC)[reply]

(I think that part of {{ja-pron}} is somehow generally problematic. (麻婆豆腐 [màbóódóꜜòfù], 野馬 [nóꜜòmà]) —suzukaze (tc) 09:14, 21 August 2017 (UTC))[reply]

Distinguishing [oɯ] from [oː] and [ei] from [eː][edit]

Though relatively rare, [oɯ] and [oː] and [ei] and [eː] do contrast in Japanese, as in ō ('king') vs. ou 追う ('to pursue'), mei-sha 名車 ('great car') vs. 目医者 me-i-sha ('eye doctor'). As far as the IPA is concerned this isn't much of a problem since e.g. {{ja-pron|o.u}} yields [o̞ɯ̟ᵝ], but this workaround wouldn't work as soon as |acc= is introduced. They should be supported in some way or another. Nardog (talk) 08:48, 28 August 2017 (UTC)[reply]

Or rather, maybe all instances of [oː] and [eː] should be represented by おー and えー instead of おう and えい, so that おう and えい would always stand for [oɯ] and [ei], as in せーしん ぶんれつ びょー instead of せいしん ぶんれつ びょう. This would make it clearer too because おう and えい being restricted to [oː] and [eː] is inherently ambiguous since, orthographically, they could always represent either [oː]/[eː] or [oɯ]/[ei]. Nardog (talk) 08:54, 28 August 2017 (UTC)[reply]

は/へ vs. わ/え[edit]

I also noticed the particles は and へ would still be shown as は and へ in the kana representation even though they are pronounced [ɰa] and [e], not [ha] and [he]. But I believe, since the kanas are in and of themselves phonetic symbols, in pronunciation illustrations, they should be the phonetic わ and え instead. (By extension one could argue を should be お too, but since there's no pronunciation variation in を and some do preserve [ɰo] for を so I don't support changing it as strongly as for the other two.) Nardog (talk) 11:31, 28 August 2017 (UTC)[reply]

IPA module[edit]

Hey @Wyang, is there a reason this template doesn't call format_IPA_full in Module:IPA? I ask primarily because this module is adding non-entry pages to Category:Japanese terms with IPA pronunciation. Thanks! —JohnC5

@JohnC5 If I remember correctly, it was because the IPA module does not allow sortkey categorisation. Wyang (talk) 05:26, 28 September 2017 (UTC)[reply]
@Wyang: How about now? ;PJohnC5
@JohnC5 All right! Changed. Wyang (talk) 05:45, 28 September 2017 (UTC)[reply]
@Wyang: Thanks! That fixed it. —JohnC5 06:01, 28 September 2017 (UTC)[reply]

日向[edit]

Discussion moved from User talk:Wyang#Japanese pronunciation oddity.

Hello Wyang, long time no write.  :)

I was cleaning up the 日向 entry, and found that the pronunciation given at 日向#Etymology_3 is a little weird. It's showing up as [çɨᵝːɡa̠], when it should be something more like [çjɯːɡa̠]. I don't suppose you could have a look at the module code? ‑‑ Eiríkr Útlendi │Tala við mig 05:47, 15 September 2017 (UTC)[reply]

(@Nardogsuzukaze (tc) 05:48, 15 September 2017 (UTC))[reply]
@Eirikr: It's perfectly accurate. See w:Japanese phonology. /u/ [ɯ̟ᵝ] becomes centralized to [ɨᵝ] after /j/, and /hj/ is [ç]. Vance (2008) had [çj], but this was criticized by Akamatsu. Nardog (talk) 05:55, 15 September 2017 (UTC)[reply]
Hello Eirikr! Long time no write. I will hand the mic now to ... Wyang (talk) 06:08, 15 September 2017 (UTC)[reply]
(after edit conflict... :) )
@Nardog:, there is a definite glide in the ひゅ sound as pronounced by speakers, which is not represented anywhere in [çɨᵝ]. The IPA is thus misleading.
Akamatsu seems to make the argument that there is no /j/ glide anywhere after /ç/, which is frankly baffling to me, as this does not match my experience at all. It leads me to wonder if he's describing a dialect, or if his home lect might be biasing his interpretation.
FWIW, I'm more interested in descriptively representing Japanese sounds, rather than hewing to any particular academic theory. ‑‑ Eiríkr Útlendi │Tala við mig 06:15, 15 September 2017 (UTC)[reply]
@Eirikr: [ç] is a palatal fricative, i.e. [j̝̊], so it is only natural a [j]-like sound is heard during the transition from [ç] to [ɨᵝ] (that is why they're called "glides" in the first place). Hence [çj] is inherently redundant, unless some language somewhere contrasted [çV] and [çjV]. Nardog (talk) 06:40, 15 September 2017 (UTC)[reply]
@Nardog: I'm familiar with palatal fricatives, but it seems I've spent too much time working on /phonemics/ and not [phonetics]. I'm happy to concede the point. ‑‑ Eiríkr Útlendi │Tala við mig 07:19, 15 September 2017 (UTC)[reply]
FWIW, it would also be very inconsistent if it were [çj]. All consonants are palatalized before /i, j/ either phonetically ([kʲ], [ɡʲ], [mʲ]...) or phonologically ([ɕ], [tɕ], [(d)ʑ]...). So if /h/ became [çj] before /i, j/, or, even more capriciously, /hi/ became [çi] but /hj/ became [çj], that would be quite an exception. Nardog (talk) 06:37, 29 September 2017 (UTC)[reply]

ざ・ず・ぜ・ぞ[edit]

Discussion moved from User talk:Wyang#More JA pronunciation questions.

I noticed that ざ・ず・ぜ・ぞ are now being rendered by the module with initial consonant [d͡z], indicating a harder onset than I hear around me. I also note that some dialects of Japanese distinguish between づ and ず, which would ostensibly be [d͡zʉ͍] and [zʉ͍].

Do you have any insight? ‑‑ Eiríkr Útlendi │Tala við mig 04:57, 28 September 2017 (UTC)[reply]

I felt we discussed at Template talk:ja-pron before when designing the template. Pinging @Nardog for his or her opinion. Wyang (talk) 05:20, 28 September 2017 (UTC)[reply]
It's in free variation, so we can't say for certain it's one or the other. The compromise the template currently adopts is to treat /z/, /zi–di/, /zu–du/, /zj–dj/ as affricates when word-initial or after /N/ and as sole fricatives when intervocalic.
Do the speakers around you at least pronounce it with the tip of the tongue at first in contact with the roof of the mouth? If so it's without a doubt an affricate, although it might not be as striking as cards in English. In fact most speakers of Standard Japanese can't (and don't realize they can't) pronounce English zoo or French genre properly without training, or grasp the difference between cars and cards.
Not only is the number of speakers who still make the distinction between /zu/ and /du/ very small (see the map at w:Yotsugana), but the distinction is not represented in orthography in many cases since the spelling reform of 1946. So we can't possibly integrate the pronunciation for speakers without the neutralization into the template. (Also note that even in words still spelled with ぢ or づ, speakers who have /zu, du/ and /zi, di/ neutralized might still pronounce them as [z]/[ʑ].) We can of course manually add non-neutralized pronunciations on the entries for relevant words, though.
See Labrune (2012:64–66) for more. Nardog (talk) 07:40, 28 September 2017 (UTC)[reply]
@Nardog: Re: yotsugana, thank you for the link, I couldn't think of the term earlier this evening. FWIW, one of my teachers years ago was from Kyushu and made the four-way distinction. Later, I lived in the Tōhoku, but in Morioka, which appears to be the pocket of yellow on the map; later on, I was in Tochigi, and later still in Tokyo. FWIW, I recall students in Tochigi deliberately overpronouncing づ in names to clarify the spelling, so at least in that rarified context, even Kantō speakers may make some distinction.
I'll keep my ears open at work over the next several days, and see if I can tease out the specifics of articulation by the native speakers around me (mostly Tokyo-ites, with some folks from Kyoto and elsewhere in Kansai). ‑‑ Eiríkr Útlendi │Tala við mig 16:19, 28 September 2017 (UTC)[reply]
ご報告楽しみにしてます。;)
Here's a relevant quote from Vance (2008:85–86):

Typically, though not consistently, [dz] occurs at the beginning of a word or in the middle of a word immediately following a syllable-final consonant [i.e. /N/ or /Q/], and [z] occurs in the middle of a word immediately following a vowel. In short, [dz] and [z] are allophones of this /z/ phoneme. Most native speakers of Japanese are quite surprised to discover that there's actually a phonetic difference to worry about, but you'll hear it if you listen carefully to pronunciations of zu [dzɯ] 図 'diagram' and chizu [cɕizɯ] 地図 'map'.

He then goes on to cite the minimal pairs (traditionally spelled くづ) vs. (traditionally くず) and 記事 (traditionally きじ) vs. 生地 (traditionally きぢ), which used to be pronounced differently "until about 400 years ago" but are now both spelled with ず/じ and not distinguished by Tokyo Japanese speakers.
Interestingly, he diverges a little bit from Labrune in saying that, in "careful pronunciation", modern Tokyo Japanese speakers always realize j (じ, じゃ, じゅ, じょ) as [dʑ], as opposed to the "typical, though not consistent," production of z (ざ, ず, ぜ, ぞ) as [z] intervocalically and as [dz] otherwise.
If this account is supported by several other scholars, I'd be willing to change the template's current realization of じ, じゃ, じゅ, じょ, i.e. [ʑ] intervocalically and [dʑ] otherwise, to always [dʑ]. Nardog (talk) 06:16, 29 September 2017 (UTC)[reply]
I support your proposition just as a native speaker. For me, /dʑ/ is the base phoneme and [ʑ] is a casual intervocalic allophone, just like the intervocalic allophone [ɣ] for the phoneme /ɡ/. — TAKASUGI Shinji (talk) 00:48, 10 March 2018 (UTC)[reply]

Verb ending with "ou"[edit]

I know that verbs ending with "ou" such as 競う, 囲う and 惑う are pronounced not /o:/ but /ou/. Naggy Nagumo (talk) 08:10, 8 December 2017 (UTC)[reply]

Sorry, I can distinguish them by writing like "まど.う". I am sorry for making noise. Naggy Nagumo (talk) 08:15, 8 December 2017 (UTC)[reply]

Katakana for IPA[edit]

Something's going wrong at 首長国: there's a katakana in the IPA transcription. —Mahāgaja (formerly Angr) · talk 14:52, 9 March 2018 (UTC)[reply]

And at ニュースキャスター too. —Mahāgaja (formerly Angr) · talk 14:55, 9 March 2018 (UTC)[reply]
ニュースキャスター has wrong parameter, I'll fix it. 首長国: When yōon like "しゅ" is devoiced, it seems go wrong. --Naggy Nagumo (talk) 23:18, 9 March 2018 (UTC)[reply]
(Notifying Eirikr, Wyang, TAKASUGI Shinji, Nibiko, Suzukaze-c, Dine2016, Poketalker, Cnilep, Britannic124, Fumiko Take, Dine2016): Anatoli T. (обсудить/вклад) 00:19, 10 March 2018 (UTC)[reply]
I think the entire module needs to be overhauled (;・∀・) —suzukaze (tc) 00:33, 10 March 2018 (UTC)[reply]
Just curious, why do you think so. It's mostly working fine, doesn't it? --Anatoli T. (обсудить/вклад) 00:51, 10 March 2018 (UTC)[reply]
It's working, but it seems fragile... —suzukaze (tc) 01:37, 10 March 2018 (UTC)[reply]
Re dev: I think it would be much easier if the dev parameters are incorporated into the kana string (e.g. つ'くよみ). Wyang (talk) 00:54, 10 March 2018 (UTC)[reply]
@Wyang, @Naggy Nagumo -- my sneaking hunch is that the handling of the devoicing parameter is screwy from the get-go. For reasons unknown to me (I haven't gone through the module codebase), dev uses a different count than acc. While acc is based on the actual mora count, dev appears to be based on the character count -- which will always diverge from the mora count for any term with yōon or other small non-moraic vowel kana (such as ファン or シェル, where the small ァ and ェ are not technically yōon as I've understood it). Since the string processing for dev is based on character count, it seems like the module can incorrectly split up phonographemes like ファ or しゅ, leaving the small kana dangling and unprocessed -- where it then appears in the final output string. ‑‑ Eiríkr Útlendi │Tala við mig 00:59, 10 March 2018 (UTC)[reply]
@Eirikr There's the |devm= parameter that counts by mora, added by Kenny before, which is one bug (out of two) less buggy. I've fixed 首長国. Wyang (talk) 02:39, 10 March 2018 (UTC)[reply]
@Wyang I think your changes have introduced a Lua error on 少し. Could you please take a look? —Internoob 06:41, 10 March 2018 (UTC)[reply]
@Internoob Thank you, it has been fixed. Wyang (talk) 06:50, 10 March 2018 (UTC)[reply]
By the way, is there a need to have the y/yomi parameter? It is already present in {{ja-kanjitab}}, and in a few cases it may depend on which spelling you choose as the main entry. --Dine2016 (talk) 14:10, 11 March 2018 (UTC)[reply]
{{ja-kanjitab}} describes the spelling, in which case which yomi is in use is potentially useful information. {{ja-pron}} describes the pronunciation, in which case, again, which yomi is in use is potentially useful information. Given the current infrastructure, if we want to have yomi in both places, we need to add the value in both places -- so far as I understand it, the scope is limited such that one template invocation on a page cannot reference any of the parameters given to another template invocation.
Not sure what you mean by "in a few cases it may depend on which spelling you choose as the main entry". The yomi in either {{ja-kanjitab}} or {{ja-pron}} should match the headword for the relevant etymology section. Any given spelling with multiple readings should have a separate etymology section for each reading. ‑‑ Eiríkr Útlendi │Tala við mig 21:13, 13 March 2018 (UTC)[reply]
"It may depend" might be referring to cases like 気まぐれ / 気紛れ, in which there is on'yomi/on'yomi+kun'yomi. —suzukaze (tc) 21:17, 13 March 2018 (UTC)[reply]
For cases like that, which don't cleanly fit even into 湯桶読み or 重箱読み categories, and for cases of longer mixed-reading compounds, I find myself coming back to the need to revamp {{ja-kanjitab}} (at a minimum) to allow editors to specify yomi for each kanji, not just for the whole term. (I mean, allow specifying for the whole term where that fits, but also allow per-kanji yomi values where whole-term reading categories won't fit.) In fact, thinking it through now, I'd prefer to have *detailed* yomi information in {{ja-kanjitab}}, and leave it out of {{ja-pron}}.
Is this idea sensible? Would that work for others? ‑‑ Eiríkr Útlendi │Tala við mig 21:26, 13 March 2018 (UTC)[reply]
There is Template_talk:ja-kanjitab#Feature_request:_jukujikun_readings and Module:User:Suzukaze-c/Hani-tab (although we are diverging from the original topic). —suzukaze (tc) 21:29, 13 March 2018 (UTC)[reply]

@Naggy Nagumo, Atitarev, Suzukaze-c, Wyang, Eirikr, Internoob, Dine2016: Now at the kanji itself is appearing in the IPA transcription, even though the template specifies the hiragana as the first positional parameter. —Mahāgaja (formerly Angr) · talk 14:48, 21 March 2018 (UTC)[reply]

I suspect that was caused by the second instance of {{ja-pron}}, {{ja-pron|a=もり.wav}}, which didn't specify any parameter except a= for the audio file. I've merged the two, and now things are displaying correctly. ‑‑ Eiríkr Útlendi │Tala við mig 19:05, 21 March 2018 (UTC)[reply]

Atamadaka notation not accounting for long vowels in first syllable[edit]

In creating the page for 聖句 (せいく), I gave {{ja-pron}} the |acc=1| parameter, since it uses an atamadaka-gata pitch accent. However, it is showing séꜜèkù for the pitch, when it should be sééꜜkù because of the long vowel in the first syllable. Should a rule be added in the module where it would place the pitch fall differently between, say, せー and せい in the first parameter? (This exception would also occur when the first syllable ends with ん.) BlueCaper (talk) 17:05, 22 June 2018 (UTC)[reply]

Each せい, せー and せん is single syllable but two morae. It should be séꜜèkù. --Naggy Nagumo (talk) 15:03, 6 July 2018 (UTC)[reply]
Yes, as Naggy stated. Atamadaka means high pitch on the first mora, not the first syllable, so séꜜèkù is correct. ‑‑ Eiríkr Útlendi │Tala við mig 20:09, 6 July 2018 (UTC)[reply]

Verb 囲う "kako.u" incorrect IPA[edit]

On the entry for 囲う, using "かこ.う" the kana correctly renders as "かこう" but the IPA says "kàkóó". The second "o" should be a "u". — This unsigned comment was added by Aogaeru4 (talkcontribs).

Agreed. (Notifying Eirikr, Wyang, TAKASUGI Shinji, Nibiko, Suzukaze-c, Dine2016, Poketalker, Cnilep, Britannic124, Fumiko Take, Nardog, Marlin Setia1, AstroVulpes, Tsukuyone): . --Anatoli T. (обсудить/вклад) 03:11, 10 July 2018 (UTC)[reply]
Note that the verb (おも) (omou) with a similar ending is working fine. --Anatoli T. (обсудить/вклад) 03:16, 10 July 2018 (UTC)[reply]
It works correctly only when acc=2 is specified. — TAKASUGI Shinji (talk) 03:27, 10 July 2018 (UTC)[reply]
I think the significant feature is that there is no problem when the accent falls on the kana before the "u". (Aogaeru4 (talk) 03:30, 10 July 2018 (UTC))[reply]
It should be fixed now. Wyang (talk) 03:43, 10 July 2018 (UTC)[reply]
Yay! Thanks, Frank. --Anatoli T. (обсудить/вклад) 03:46, 10 July 2018 (UTC)[reply]

yutōyomi vs k,o[edit]

Discussion moved to Template talk:ja-kanjitab

呉音 and 漢音[edit]

See the Wikipedia articles Go-on and Kan-on. Also compare the following Google hits:

The second one mostly shows our project, which is a bad sign: we don’t follow the mainstream spellings. — TAKASUGI Shinji (talk) 11:31, 20 May 2019 (UTC)[reply]

  • FWIW, I note that Google doesn't differentiate between searches for [ kan-on] and searches for [ kan'on]. Regarding the convention of romanizing 呉音 as go-on with a hyphen, I think that's largely to differentiate from the regular English term goon. I see that the WP article at Go-on has had the same headword spelling since 2008, and no discussion on its talk page about the romanization.
By way of comparison, I just had a look in the index of Shibatani's The Languages of Japan, and I see that he used kan'on and go'on.
Considering that go-on and go'on appear to be more common in English-language texts as the romanization for 呉音, I think we should have entries at these spellings, in accordance with our overarching policy of being a descriptive dictionary. If a term is in demonstrable use, we should generally have an entry for it. This also improves discoverability. However, for purposes of our own usage in headings and text in Japanese entries, I don't consider Google usage patterns to be a strong indicator for how we should spell things: for Japanese terms written in the Latin alphabet, we use a variant of Hepburn, as described at Wiktionary:Japanese transliteration.
That said, it's good to discuss this from time to time and make sure we're in accordance at least amongst ourselves. ‑‑ Eiríkr Útlendi │Tala við mig 17:46, 23 May 2019 (UTC)[reply]

Strange behavior at ハノイ[edit]

For some reason, the pronunciation at ハノイ is given as "[háꜜnòì]" in transcription (so far quite reasonable), but as "[ɰᵝa̠ no̞i]" in IPA, which seems to be "wanoi" rather than the "hanoi" I'd expect. What's happening here? MuDavid (talk) 08:11, 23 May 2019 (UTC)[reply]

@MuDavid: Moved the discussion from Module talk:ja. Removing the space from the word fixes the transcription: {{ja-pron|ハノイ|acc=1|acc_ref=NHK}} gives [ha̠no̞i]. Perhaps on its own is being interpreted as the particle . — Eru·tuon 08:57, 23 May 2019 (UTC)[reply]
Indeed it does, thank you! MuDavid (talk) 09:05, 23 May 2019 (UTC)[reply]

char count or mora count for "dev"?[edit]

Since I changed the module, "dev" should be an index by mora, not by character. Previously dev was counted by character, so entries including yōon(拗音) will be incorrect. (e.g. 百姓 {{ja-pron|ひゃくしょう|dev=3}} will be incorrect ひゃく(しょ)ー, now it should be dev=2). If this change is bad, please revert or fix the module. If this change is okay, please fix entries. --Naggy Nagumo (talk) 00:29, 7 March 2020 (UTC)[reply]

At most 182 entries may be related [1][2]Naggy Nagumo (talk) 00:37, 7 March 2020 (UTC)[reply]

@Naggy Nagumo: Thank you! I'd long been puzzled by that. At some point there was a separate devm parameter to handle devoicing specifically by mora count -- but I've never understood why that was even created, since (as you note) one would never want to mark devoicing on any other basis. This might entail some clean-up, though, for entries with 拗音 (and possibly also 促音). Cheers! ‑‑ Eiríkr Útlendi │Tala við mig 00:44, 7 March 2020 (UTC)[reply]
@Mahagaja Fixed. —Naggy Nagumo (talk) 05:55, 3 July 2020 (UTC)[reply]

Deprecate yomi=/y= for this template and Kansai/Kyoto pitch accents[edit]

Since {{ja-kanjitab}} is enough for the yomi= parameter, might as well remove the yomi=/y= parameter of this template? That is of course if there is a bot who can gladly make the necessary changes.

In other news, I have been experimenting with ああ and 明日 regarding the Kansai/Kyoto accents after @Kyoww's edits and tried to unify it into one single template with the standard Tokyo accent, which proves to be disastrous:

In this example, the Kyoto accent should be Kōki instead of Atamadaka and parameters prefixed with acc3= were put in the first line or acc=/acc1=.


This one has two separate templates, better than nothing, the Kyoto accent still has to be labeled Kōki instead of Atamadaka.


This is the current one with the correct Kyoto pitch accent and not using the template as in the pronunciation under the Okinawan header. The IPA is somewhat the same as the last two though.

Any thoughts? Is there to need to update this template for the dialects? Remember that the pitch accents are not in line as the standard Tokyo accents. ~ POKéTalker05:35, 7 March 2020 (UTC)[reply]

@Poketalker: For the yomi or y parameters, it's probably enough to just remove or comment out the relevant code in the module. I'm less familiar with our Lua infrastructure, but for regular templates that don't use modules, any parameter in a template call that isn't handled by the template does ... nothing. It only appears in the wikicode, with no effect on the output.
If we wanted to be clever about it, we'd replace the handling code for yomi or y values to instead add the entry to a maintenance category, which we would then use to either manually or bot-wise go through and remove the parameters and arguments.
For the pronunciation details, I have some questions.
  • Is that specifically Kyōto accent, or is it more generally Kansai?
  • What is Kōki? Since this is supposed to be information for the reader of the template's output, in order to provide further detail required to understand that output, I'd recommend that we create that entry, and any other relevant entries, before building out their use in this template.
Good stuff! Cheers, ‑‑ Eiríkr Útlendi │Tala við mig 05:58, 7 March 2020 (UTC)[reply]
@Eirikr: after checking Kansai_dialect#Pitch_accent link (which you might have not seen in my starting post), also forgot to put Japanese_pitch_accent#Kyoto–Osaka_(Keihan_type) for your convenience. You're right that the accent mentioned is Kansai *in general*, don't see any difference there. 高起 (kōki) literally means a “high-rising” accent in which the first mora has a high pitch and the next one(s) usually have high pitch(es). Contrasts to 低起 (teiki), literally “low-rising” where the first mora has a low pitch. Both yield no entries in Kotobank, though. ~ POKéTalker06:55, 7 March 2020 (UTC)[reply]
@Eirikr, @Poketalker: I wrote a bot script today to get rid of the yomi arguments from the {{ja-pron}} from entries in {{tracking/ja-pron/yomi}}. The source code is here, if you would be interested in reading it. I'm going to make a post in the Beer parlour – if you think the script is good enough, please lend me your support over there! Thanks, Kiril kovachev (talk) 15:25, 29 May 2023 (UTC)[reply]

References[edit]

  1. 1.0 1.1 1.2 Nakai, Yukihiko, editor (2002), 京阪系アクセント辞典 [A Dictionary of Tone on Words of the Keihan-type Dialects] (in Japanese), Tōkyō: Bensei, →ISBN
  2. 2.0 2.1 2.2 2.3 2.4 2.5 NHK Broadcasting Culture Research Institute, editor (1998), NHK日本語発音アクセント辞典 [NHK Japanese Pronunciation Accent Dictionary] (in Japanese), Tōkyō: NHK Publishing, →ISBN
  3. 3.0 3.1 3.2 Matsumura, Akira, editor (2006), 大辞林 (in Japanese), Third edition, Tōkyō: Sanseidō, →ISBN
  4. 4.0 4.1 4.2 Kindaichi, Kyōsuke et al., editors (1997), 新明解国語辞典 (in Japanese), Fifth edition, Tōkyō: Sanseidō, →ISBN

Odd output[edit]

I noticed at 石榴 (zakuro) that {{ja-pron}} is outputting initial /z-/ instead as [d͡z-]. I don't think this is correct; while that might be an allophone, I don't think it's the main form.

Meanwhile, at 砕氷船 (saihyōsen), I noticed that {{ja-pron}} is outputting /hjo/ as [ço̞], completely missing the palatal glide.

Does anyone have any insight into what's going on here? I'm pretty sure these both represent changes that have crept in some time over the past few months while I've been busy off-site. ‑‑ Eiríkr Útlendi │Tala við mig 06:02, 25 May 2020 (UTC)[reply]

Utterance-initial /z/ is typically [dz]. See Vance (2008: 85), Labrune (2012: 64), Maekawa (2010). We talked about [ç] vs [çj] in #日向 above. As I said then, a palatal glide articulation is implied because the tongue inevitably passes through that position during the transition from a voiceless palatal fricative to the following vowel. /C/ before /i/ and /Cj/ are consistently represented as a single palatal(ized) consonant ([kʲ ɡʲ ɕ ʑ t͡ɕ d͡ʑ ɲ bʲ pʲ mʲ ɾʲ]), so it makes no sense to single out /h/ before /i/ and /hj/ and represent them as two segments. Nardog (talk) 13:04, 11 February 2022 (UTC)[reply]

Katakana long mark[edit]

@Eirikr, Erutuon, Naggy Nagumo, Nardog, Suzukaze-c, TAKASUGI Shinji: the katakana long mark is causing errors, for example at キャッシュディスペンサー and several others in Category:IPA pronunciations with invalid IPA characters. —Mahāgaja · talk 18:59, 4 July 2020 (UTC)[reply]

"dev" index on the pages was invalid. —Naggy Nagumo (talk) 23:16, 4 July 2020 (UTC)[reply]

Hyphens[edit]

Do we want hyphens?

Currently:

{{ja-pron|あい-うえお|acc=3}}い-うえお
instead of い-うえお

Suzukaze-c (talk) 21:38, 19 September 2020 (UTC)[reply]

Katakana for IPA again[edit]

(Notifying Eirikr, TAKASUGI Shinji, Atitarev, Fish bowl, Poketalker, Cnilep, Marlin Setia1, Huhu9001, 荒巻モロゾフ, 片割れ靴下, Onionbar, Shen233, Alves9, Cpt.Guapo): The template is not parsing しゅ correctly; it's putting a stray "ュ" into the IPA. See アラブ首長国連邦 for an example, though there are many others at CAT:IPA pronunciations with invalid IPA characters. —Mahāgaja · talk 20:41, 28 January 2022 (UTC)[reply]

It may have something to do with this edit by @Rdoegcd. —Mahāgaja · talk 20:43, 28 January 2022 (UTC)[reply]

Hi. I undid my edit. I was trying to fix an issue on 可視光線 where the devoicing diacritic was being added to the space between the words in the IPA, but I think my edit introduced another bug. --Rdoegcd (talk) 00:39, 29 January 2022 (UTC)[reply]
Fixed the above, I think. However, the module still produces an unexpected result if for some reason you try to devoice っ, ん, or a mora before ん or if you enter characters besides kana, ー, spaces, and periods. Rdoegcd (talk) 18:53, 30 January 2022 (UTC)[reply]

"Devoiced /a/"?[edit]

@Rdoegcd --

Re: diff and your edit comment, "fix bug where devoiced /a/ yields ḁ instead of a in romaji, try mw.log(p.accent('かきくけこ',0,'1,2,3,4,5'))".

Where is there ever a "devoiced /a/" in Japanese? In all I've read so far, Japanese only devoices /i/ and /u/, and only in specific situations. I've never heard (ha!) of any cases of devoiced /a/, /o/, or /e/.

Curious, ‑‑ Eiríkr Útlendi │Tala við mig 09:34, 25 March 2022 (UTC)[reply]

I saw it on , where it says
Lua error in Module:ja-pron at line 77: Parameter "y" is not used by this template.
I have no idea if it is correct. Japanese_phonology#Devoicing also mentions that /o, a/ may be devoiced sometimes. Rdoegcd (talk) 16:02, 25 March 2022 (UTC)[reply]
Not sure if it's appropriate to use this template for dialects other than standard/Tokyo Japanese. Most of them don't have the same two-accent system and the narrow transcription the module produces is bound to be way off. Nardog (talk) 11:59, 4 April 2022 (UTC)[reply]
Looks like Poketalker (talkcontribs) added that pronunciation information in December 2018. Poketalker, do you have a reference for that, or more generally any further detail on Kagoshima pronunciation patterns and whether {{ja-pron}} really fits for this? I know that some dialects aren't moraic and are syllabic instead, which wouldn't fit {{ja-pron}} in its current implementation. ‑‑ Eiríkr Útlendi │Tala við mig 19:27, 4 April 2022 (UTC)[reply]
Re: Japanese_phonology#Devoicing and the examples for /o/ and /a/, in Tokyo speech, that only happens in fast speech -- in which case, {{ja-pron}} is not the place to document that. Fast speech is a different beast, and I'm not comfortable at present trying to document that anywhere except maybe usage notes. — This unsigned comment was added by Eirikr (talkcontribs) at 19:29, 4 April 2022 (UTC).[reply]
@Eirikr: that was a blind insertion from a StarLing entry years ago. There was no accent on the ta there, so had to improvise. You could check Martin's book (JLTT) as referenced, but my impression at that time was no way to de-accent that mora. Could try to look at a Japanese etymological database, takes a while. ~ POKéTalker00:25, 5 April 2022 (UTC)[reply]
@Poketalker: Thank you for the background. FWIW, I am doubtful of Starling's derivation -- Japanese cartographic traditions put the north on the left, similar to one possible derivation for English north, and not behind as Starling contends. Also, I thought I recalled Shibatani suggesting that Kansai / Kagoshima pitch accent patterns were actually the innovation, and that Kantō was more conservative, which would put a further kink in Starling's theory... I'll have to look through my local copy. Cheers! ‑‑ Eiríkr Útlendi │Tala við mig 04:57, 12 April 2022 (UTC)[reply]

Please add SMK7 (Template:R:Shinmeikai7)[edit]

Could somebody who have a right for editing Module:ja-pron please add ['SMK7'] = 'R:Shinmeikai7', in ref_template_name_data. Lugria (talk) 00:03, 16 September 2022 (UTC)[reply]

I got the right, so I added it. Lugria (talk) 13:02, 17 September 2022 (UTC)[reply]

[ŋ][edit]

(Notifying Eirikr, TAKASUGI Shinji, Atitarev, Fish bowl, Poketalker, Cnilep, Marlin Setia1, Huhu9001, 荒巻モロゾフ, 片割れ靴下, Onionbar, Shen233, Alves9, Cpt.Guapo, Sartma, Lugria):

I think it would be informative and useful if we modified {{ja-pron}} so that it produces [ŋ] where applicable. Given that the template outputs phonetic, not phonemic IPA, it is actually in a way "necessary" for us to use [ŋ] where applicable if we want to be correct. In the article , we use [ŋ] manually (i.e. using {{IPA}} directly) but almost none of the other entries (where it would be correct) use both [g] as well as [ŋ] (there's also entries like 天が下 (before I've changed it) where only [ŋ] was used). I could easily modify the module so that it produces both a [g] as well as a [ŋ] variant if given か゚ but then the task remains that か゚ needs to be added to potentially thousands of entries. Is there a better solution? What do others think? — Fytcha T | L | C 00:29, 29 November 2022 (UTC)[reply]

Word-internal /ɡ/ as [ŋ] is a regressive feature, now consistently exhibited only by older Tokyo speakers.
  • Vance (2008: 214): "According to descriptions written around 1940, many Tokyo natives were consistent nasal speakers at that time, but the proportion of inconsistent speakers and consistent stop speakers was on the increase. More recent sociolinguistic studies indicate that the proportion of consistent stop speakers has in fact increased and that, at least in some Tokyo neighborhoods, the proportion of tokens pronounced with [ŋ]/[ŋʲ] correlates with the age of the speaker."
  • Labrune (2012: 78): "The study by Hibiya (1999) clearly demonstrates that there is a clear pattern of age stratification in the use of [ŋ], which drops off as age diminishes."
Nardog (talk) 17:44, 29 November 2022 (UTC)[reply]
@Fytcha: What @Nardog wrote is correct. Only a very small part of elderly people still uses /ŋ/, most people either say /g/ or /ɣ/ (the latter might even be more prominent in Tokyo). — Sartma 𒁾𒁉𒊭 𒌑𒊑𒀉𒁲 10:37, 1 December 2022 (UTC)[reply]
Slashes enclose phonemes. We're talking about realizations. Nardog (talk) 22:40, 1 December 2022 (UTC)[reply]
@Nardog: I meant realisations too, hence the verbs "use" and "say". Sometimes it's good to just be practical, see the big picture, and understand what people are trying to say, without being unnecessarily pedantic. — Sartma 𒁾𒁉𒊭 𒌑𒊑𒀉𒁲 23:16, 1 December 2022 (UTC)[reply]
I know you did, "we" was inclusive. When I see as grievous an error as using slashes to enclose allophones, I correct it not just for the person making the error but for anybody who sees it. Nardog (talk) 23:22, 1 December 2022 (UTC)[reply]
@Nardog: I understand that spirit. It's the same one that makes me call out pedantry when there is an excess of it. I don't do that just for the person being unnecessarily pedantic, but for anybody to know that it is ok to call it out. It's perfectly fine to point to the moon with a dirty finger. The enlightened ones will know what to look at. — Sartma 𒁾𒁉𒊭 𒌑𒊑𒀉𒁲 00:36, 2 December 2022 (UTC)[reply]

|acc2_note not working properly if there is no |acc_note[edit]

The note intended for the second accent is displayed after the first accent, see 千本. Pinging @Rdoegcd. — Fytcha T | L | C 02:11, 29 November 2022 (UTC)[reply]

@Fytcha Fixed. Rdoegcd (talk) 06:27, 29 November 2022 (UTC)[reply]

|acc_note not rendering if there is no pitch specified[edit]

There are cases where a word might not have any pitch that we can specify, but we still want to add acc_note to clarify -- particularly for verb suffixes, where the pitch depends on the verb to which the suffix attaches. See the wikicode at なば (naba), for instance.

Could someone please update the template / module to allow acc_note (and acc2_note, etc.) even if there is no acc (or acc2, etc.)? ‑‑ Eiríkr Útlendi │Tala við mig 21:20, 27 December 2022 (UTC)[reply]

d͡z/z and d͡ʑ/ʑ[edit]

The module currently produces [ɕĩn(d͡)ʑɨᵝ] for 真珠, which doesn't look right. I think it should be either [ɕĩɲ̟d͡ʑɨᵝ] (in the traditional description mentioned in w:Japanese phonology) or [ɕĩɰ̃ʑɨᵝ] (taking into account the Maekawa (2010) study and treating /N/ the same way as in 新種 [ɕĩɰ̃ɕɨᵝ]). Also, the d͡z/z and d͡ʑ/ʑ variations should hold before all vowels, not just before /i, u/, right? Rdoegcd (talk) 23:57, 1 April 2023 (UTC)[reply]

[ɕĩɰ̃ʑɨᵝ] wouldn't be "taking into account the Maekawa (2010) study". He says /N/ "strongly enhanced [rate of affricate realization]" (p. 365). Nardog (talk) 12:53, 2 April 2023 (UTC)[reply]
Removed it. The module already assumes the Tokyo-style yotsugana merger, which means [z] and [dz], and [ʑ] and [dʑ], are in complementary distribution. The addition seems to have been motivated by an incomplete understanding of the merger. Nardog (talk) 13:03, 2 April 2023 (UTC)[reply]

Add a hidden category for unsourced pitch accents[edit]

Hi,

Could someone add a hidden maintenance category when no reference is given for a pitch accent? This would allow me to check them and add references.

Thanks. Thibaut (talk) 11:11, 15 May 2023 (UTC)[reply]

@Thibaut120094 With regards to this, I regularly add pitch accents from JPDB.io because it's a very straightforward resource to access and means I don't need to open my NHK ebook and seek alphabetically for it; is there any other accepted online source that can be put in instead? Kiril kovachev (talk) 23:12, 28 May 2023 (UTC)[reply]
JPDB.io uses AI to autogenerate pitch accent graphs [3][4], it might not be accurate.
It’s best to stick to authoritative Japanese dictionaries like the NHK dictionary or the Daijirin, paid digital versions are available on various platforms (for example on iOS and macOS). Thibaut (talk) 23:53, 28 May 2023 (UTC)[reply]
I see, I apologise then if I may have polluted some entries with wrong pitch accents, although I believe the main ones I've been porting have been confirmed as opposed to AI-generated (there's a warning icon to signify if the accent is auto-generated). Anyway, this is perhaps a good wake-up call to simply make the effort to look it up, since I do already have a copy of NHK. Thanks for your advice, Kiril kovachev (talk) 13:55, 29 May 2023 (UTC)[reply]
Also, I found [5] for looking up words through NHK, this way you don't need a paper or ebook copy but can just look it up online. Kiril kovachev (talk) 22:27, 6 June 2023 (UTC)[reply]
@Thibaut120094 Here, I made it so they're now added to Special:WhatLinksHere/Template:tracking/ja-pron/unsourced_accent, but be warned there is a ton of them. Maybe there is a better way of doing this—in fact, I was wondering whether this should be a proper category, maybe which would appear under Category:Requests concerning Japanese, but unfortunately I don't know how to do that... Sorry that it took such a while when the change was really little in fact... Kiril kovachev (talk) 22:26, 6 June 2023 (UTC)[reply]
Thank you very much! Thibaut (talk) 08:32, 12 June 2023 (UTC)[reply]
No problem! Kiril kovachev (talk) 10:01, 25 June 2023 (UTC)[reply]

recent edit[edit]

(Notifying Eirikr, TAKASUGI Shinji, Atitarev, Fish bowl, Poketalker, Cnilep, Marlin Setia1, Huhu9001, 荒巻モロゾフ, 片割れ靴下, Onionbar, Shen233, Alves9, Cpt.Guapo, Sartma, Lugria, LittleWhole, Dixtosa, Kc kennylau, Rua, Ruakh, ZxxZxxZ, Erutuon, Jberkel, JohnC5, Benwing2, RichardW57, Chuck Entz, Metaknowledge, SemperBlotto, Dixtosa, Rua, Benwing2, Dixtosa, The Editor's Apprentice): the edit was meant to be minor edit; i was rushing and forgot to click the button Chuterix (talk) 04:37, 3 June 2023 (UTC)[reply]

also notifying @Theknightwho Chuterix (talk) 04:38, 3 June 2023 (UTC)[reply]
i fixed module error alos minor edit
can someone get shuri naha to work Chuterix (talk) 04:44, 3 June 2023 (UTC)[reply]
Okinawan has its own ISO 639 code, you should just fork the template/module rather than extend them. Nardog (talk) 07:23, 3 June 2023 (UTC)[reply]
@Nardog No - that is a really bad idea, because it causes a maintenance nightmare where modules for smaller sister languages effectively get neglected; we've had this issue with Okinawan already. We will do what we've done with the other Japanese modules, which is to adapt them to multiple languages. Honestly, I have no idea why you would suggest that. Theknightwho (talk) 17:41, 3 June 2023 (UTC)[reply]
Agreed with @Theknightwho. I tried to do that, but I don't know Lua and got reverted by @Chuck Entz. Chuterix (talk) 17:46, 3 June 2023 (UTC)[reply]
@Chuterix: I reverted you for two reasons: 1) There were errors that looked like they would affect all 42,000 transclusions and I don't know how to fix them- all I could do was revert to the state before the changes were made, and 2) Renaming "ja-pron" to just "pron" was a really bad idea. This isn't a general module that covers pronunciation in all languages, but one that covers specific writing systems. If someone ever does want to create a general module along those lines, the name would be already taken. It would be better to come up with a name that everyone can agree with before moving the module.
This wasn't a minor error: it broke all uses of the template (42,000 of them), and flooded CAT:E. Even after my revert fixed the errors, they were still flooding in there. I managed to purge a couple thousand entries before I fell asleep, but there were in excess of 7,000 more, and increasing by the minute. Meanwhile, there was a (probably) fixable module error in the Okinawan entry at that was not addressed because there was no way to tell it apart from the errors from your edits. I spotted it at the time, but never got around to telling @Theknightwho because I had this mess to deal with. Chuck Entz (talk) 18:20, 3 June 2023 (UTC)[reply]
BTW: there's an error at 欲しゃい that you need to fix (I think you used the wrong name in the module). Chuck Entz (talk) 18:26, 3 June 2023 (UTC)[reply]
@Chuck Entz I've fixed the error at : I merged Module:ryu-headword into Module:Jpan-headword (formerly Module:ja-headword, but now able to handle all Japonic languages). Module:ryu-headword had effectively been abandoned since it was forked back in 2019, and it looks like it was slightly more permissive with its parameters (i.e. it tried to guess which parameter was which based on the script, instead of just having fixed parameters). It's a feature that's gradually being phased out of East Asian templates, because it's a pointless waste of resources that causes lots of issues with edge cases where Latin letters are used in Jpan, Hani etc., such as Japanese NATO (NATO).
The error at 欲しゃい was caused by Chuterix forking Module:ja-see as Module:kzg-see, but without looking into the details I don't specifically know what the problem was. Much better to just wait until Module:ja-see has been converted into Module:Jpan-see so it can be done properly. Theknightwho (talk) 18:42, 3 June 2023 (UTC)[reply]

Removed yomi code[edit]

Dear all, as per my recent bot action, all instances of the yomi parameter we deprecated a few months ago have been removed, so I got rid of the code that originally handled transcluding pages into the tracking template, as well as the straggling data structures used to match the parameters to the reading names. Kiril kovachev (talk) 09:59, 25 June 2023 (UTC)[reply]

Custom accent[edit]

What about Kyoto, Kagoshima, Ryukyuan (for {{ryu-pron}}), possibly other accents?

I've documented the accent info here; there's no falling pitch accent, initial 2 morae accent, nor unaccent in more than two morae. I have suggested this in Discord, but both theknightwho and Fish bowl (i contacted in WT) are unable to do this ATM.

We can put this in a collapsible section if someone says "we should put the dialectial accent info somewhere else because we should only focus on "standard" (tokyo) pronunciation". We're trying to make this like {{zh-pron}} in the future and any contributions are highly appreciated.

A sample from (the dead account of) @荒巻モロゾフ can be viewed at User:荒巻モロゾフ/draft.

(Notifying Eirikr, TAKASUGI Shinji, Atitarev, Fish bowl, Poketalker, Cnilep, Marlin Setia1, Huhu9001, 荒巻モロゾフ, 片割れ靴下, Onionbar, Shen233, Alves9, Cpt.Guapo, Sartma, Lugria, LittleWhole, Mcph2): Chuterix (talk) 23:30, 1 January 2024 (UTC)[reply]

Add DJR4[edit]

Someone who has a right to edit Module:ja-pron, please add ['DJR4'] = 'R:Daijirin4', in ref_template_name_data. Lugriaルグリア [会話貢献] 06:21, 13 March 2024 (UTC)[reply]