Wiktionary:Beer parlour/2021/June: difference between revisions

From Wiktionary, the free dictionary
Jump to navigation Jump to search
Content deleted Content added
→‎{{tl|bor+}} and {{tl|inh+}}: Chuck Entz asked Victar to make his case - well, here is mine.
Line 204: Line 204:
:I agree, but this seems like a discussion for [[WT:RFDO]], not the Beer Parlor. —[[User:Mahagaja|Mahāgaja]] · [[User talk:Mahagaja|''talk'']] 19:07, 18 June 2021 (UTC)
:I agree, but this seems like a discussion for [[WT:RFDO]], not the Beer Parlor. —[[User:Mahagaja|Mahāgaja]] · [[User talk:Mahagaja|''talk'']] 19:07, 18 June 2021 (UTC)
:: [[Wiktionary:Requests_for_deletion/Others#Template:bor+_and_Template:inh+|RFD]] created as well. --<code>&#123;&#123;[[User:Victar|victar]]|[[User talk:Victar|talk]]&#125;&#125;</code> 20:36, 18 June 2021 (UTC)
:: [[Wiktionary:Requests_for_deletion/Others#Template:bor+_and_Template:inh+|RFD]] created as well. --<code>&#123;&#123;[[User:Victar|victar]]|[[User talk:Victar|talk]]&#125;&#125;</code> 20:36, 18 June 2021 (UTC)
::: I think lengthy case presentings shouldn't be part of RFDs, so I'm going to give mine here, where it [[WT:Beer parlour/‎2021/June#Pacifying User:The Nicodene|seems to be tolerated]].
::: These templates' function is to make editing easier for a small number of editors (by name, mostly, but probably not exclusively, SodhakSH, Inqilābī and Brutal Russian), and arguably create a regular wording for etymology sections (although I, as well as others, dispute that). The necessity of the text that is now being displayed using these templates is [[WT:Votes/2017-06/borrowing, borrowed|very much disputed]] to a degree that a supermajority (13 to 5) has voted to abolish giving the text within the {{tl|bor}} template. Of course, one could argue that adding the template {{tl|bor+}} isn't contradictory to that vote, but seeing as [[WT:Votes/2021-04/Creation of Template:inh+ and Template:bor+|the vote concerning adding bor+ also failed]] I wouldn't be so certain of that.
::: Some bring up the issue that AryamanA voted just past the time and thus failed to make the difference, but seeing as PUC was also going to vote oppose (mind you, an oppose vote is worth twice a support), the vote would have failed anyway.
::: Now, some have brought up that a new template's creation shouldn't need any vote, but I'd argue that since this template is one in a series of arguably most used templates (after {{tl|head}}, {{tl|l}} and {{tl|m}}), any creation of a template that takes over a part of or even the whole function of {{tl|bor}} or {{tl|inh}} should get a vote, which it did in this case, and would have even without the initiative of the template's advocates.
::: So, to reiterate, the proposal to create these templates was turned down in a democratic process that is of the highest form we have. An RFD discussion is of no value, since it's not as important as a vote, and as such I, and anyone who agrees with me, plead to the administrators of this project to delete these templates, lock them and either create another vote (which I personally would find absurd, since we just finished this one, and we are currently not in the season where the majority of Wiktionary editors is regularly editing), or just ban the creation of such templates until a supermajority of Wiktionary editors actually agrees that this template should be created. I thank you for your time.<small>{{ping|Victar|SodhakSH|Inqilābī|Brutal Russian|Mahagaja|Fenakhay|Imetsia|Benwing2|PUC|Lambiam|Andrew Sheedy|Bhagadatta|Chuck Entz}}</small> [[User:Thadh|Thadh]] ([[User talk:Thadh|talk]]) 11:09, 21 June 2021 (UTC)


== Categorization bot ==
== Categorization bot ==

Revision as of 11:09, 21 June 2021


Adding Sumerogram, Akkadogram and Determinative to standard POS

Hi! I've been working on Akkadian, Sumerian and Cuneiform Translingual entries recently. I've been using "Sumerogram", "Akkadogram" and "Determinative" as POS when needed, but I've been made aware those are not standard and could cause issues. (see 𒌉 for usage examples of Sumerograms, 𒀭 for Determinatives and 𒅆 for Akkadograms)

All three of them are necessary to structure Cuneiform entries for Akkadian and Sumerian (and Hittite, too) in a consistent way. Therefore, I'd like to propose adding them to the standard POS list. Do I have your vote? :D Sartma (talk) 09:08, 1 June 2021 (UTC)[reply]

Support. Tied to this, could "phonogram" be recognized as a header? See e.g. Old Korean .--Tibidibi (talk) 09:20, 1 June 2021 (UTC)[reply]
Per Fay Freak, revise vote to support "heterogram".--Tibidibi (talk) 16:51, 1 June 2021 (UTC)[reply]
On third thought, go back to supporting the original proposal. Oppose "logogram" because the category could lead to inconsistencies with other languages that use logograms, e.g. Japanese, and the existing POS setup should be preserved for those languages.--Tibidibi (talk) 17:49, 3 June 2021 (UTC)[reply]
@Tibidibi I was thinking last night that even if we end up choosing Logogram as POS, we would do so because it's the familiar term in Mesopotamian studies, in the same way Kanji is for Japanese, so it wouldn't really create any inconsistency with Japanese. Using Logogram for Akkadian/Hittite wouldn't necessarily mean we have to change POS for other languages. Sartma (talk) 18:05, 6 June 2021 (UTC)[reply]
Support. — Fenakhay (تكلم معاي · ما ساهمت) 15:41, 1 June 2021 (UTC)[reply]
uncertain - but could you at least add some actual definitions or translations to those entries. SemperBlotto (talk) 15:49, 1 June 2021 (UTC)[reply]
Hi! Sumerogram, Akkadogram and Determinative are categories of a cuneiform sign, in the same way Noun, Verb or Adjective are categories of a word. They classify the sign and under them we give a link to all the different words that can be written with that sign, so you will not find any "actual" definition or translation there. You find all relevant information in the page of the words listed under each category. If you take a second to check the pages I linked above you can see what I mean. Try clicking on a couple of the words linked as "Sumerogram of" or "Akkadogram of" under Sumerogram/Akkadogram and you'll be redirected to those words' entries. Sartma (talk) 16:55, 1 June 2021 (UTC)[reply]
@SemperBlotto: So the Akkadian and Sumerian word mentioned at مَيْس (mays) uses the sign 𒄑 (GIŠ) which wasn’t pronounced, when used as a determinative, but indicated to the reader that now the name of a tree follows (you might discern that the sign looks like a tree if you have a font for it installed). For this reason one might not parse the whole cuneiform string as a word so that one seeks a separate entry for 𒄑 (GIŠ), and any such signs, categorizating them as so-called determinatives, or taxograms or semagrams.
A heterogram is when you write mlkʾ, from the Aramaic spelling of the Semitic term for “king” *malk-, but actually mean and say شاه (šāh). They did such things frequently in the Ancient Near East. Fay Freak (talk) 17:16, 1 June 2021 (UTC)[reply]
Why not Heterogram? So one can use it later for Aramaeograms in Pahlavi etc. (@Victar) The definition line template {{sumerogram of}} already says “sumerogram”.
I guess Heterogram would work too, if you really are against Sumerogram/Akkadogram. It's just not a word you would find in Mesopotamian studies that much (I never saw it before now! XD), so it would be a bit confusing/alienating to people looking up Akkadian words. It says what it is, it just doesn't paint it a familiar colour. Like, when you study Akkadian you have glossaries and dictionaries with Akkadian words and then you have lists of "Sumerograms" (that unluckily never give the cuneiform, they're just like "A = water, A.BA = father, etc.) . I understand that Heterogram is more versatile, and I'm not against it in principle, but if there's no strong reason to change the labels, I would prefer to keep the more familiar ones. In the end, that's what they do in languages that use Han characters too (Japanese calling them Kanji, Korean Hanja, etc.). If we decide to go for Heterogram, then we should probably ask Japanese and Corean editors to change their entries accordingly too. Sartma (talk) 18:31, 1 June 2021 (UTC)[reply]
Actually, there's another word that's widely use in Mesopotamian studies: Logogram. That would be generic like Heterogram, including both Sumerograms and Akkadograms. What about Logogram? Again, if we choose a more general name, then for consistency we need to change also Kanji and Hanja, since they both are just Logograms.Sartma (talk) 10:56, 2 June 2021 (UTC)[reply]
@Sartma Two points:
  • There is a difference in that most people consulting Korean or Japanese entries are (hopefully) going to be casual learners, to whom "Hanja" and "Kanji" are the familiar terms, while most people consulting Akkadian entries will be people with at least some linguistics background who can be relied on to be more familiar with terms such as logogram, heterogram, etc.
@Tibidibi I don't think that we should base our decisions on the perceived or hypothetical readers of Wiktionary entries, arbitrarily discriminating by language (Japanese and Korean: ok; Akkadian: no, sorry): in other words, I'd like to be able to have a discussion based on facts and not personal feelings or perceptions. There will be a lot of casual learners of Akkadian and Sumerian consulting Wiktionary (judging by existing Akkadian and Sumerian entries, I can assure you that who wrote them was probably even more casually learning them than people using Wiktionary for Japanese and Korean...) to whom "Sumerogram", "Akkadogram" and "Determinative" are the most familiar terms (if not the only one they'll ever hear). I would like to write entries for the vastest possible public, but mainly for a public that's actually interested in Sumerian and Akkadian, not for general "people with some linguistics background". I'd like those entries to be useful to those who are studying those languages, not to "others" (what sense would it make to do otherwise?). Every language has its own "technical" terms. I'm not sure who we are pleasing by changing well established terms to favour others that would just make everything less clear, confusing and alienating. We don't do that with Latin, Ancient Greek or Sanscrit, were all traditional categories are maintained, whether they make "linguistically" sense or not. I'd like to see the same respect for Akkadian and Sumerian too. Sartma (talk) 15:58, 2 June 2021 (UTC)[reply]
@Sartma Okay, I take back the point about discriminating by langue. But we are not removing the "Sumerogram" and "Akkadogram" terms, and they are still displayed prominently in the page. They are still on the page due to {{sumerogram of}}, only the title of the header is "heterogram". So no information is lost, and if anything information is added; people will now know from the header that these are heterograms.--Tibidibi (talk) 16:10, 2 June 2021 (UTC)[reply]
  • Logogram is an extremely broad term while, to the best of my knowledge, full-scale heterogramic systems are more-or-less exclusive to the Ancient Near East, Japanese (modern and historical), and Old Korean; Chữ Nôm does not really use Chinese characters in this way. And since heterogramic entries are not made for Old Korean (there is no point because the phonetic component is not known) while Japanese has its own system already, it seems better to use "heterogram", which would become a more precise category exclusively used for extinct languages of the Near East. Modern Hanja are not heterograms.--Tibidibi (talk) 15:18, 2 June 2021 (UTC)[reply]
@Tibidibi: Logogram is a hyperonym of heterogram. The choice between the two should therefore take relevance and pertinence into account. Is it necessary to use "heterogram" instead of its hyperonym "logogram"? Does "heterogram" add any relevant/pertinent information that "logogram" doesn't express already? I'd argue that for the use in Akkadian entries "logogram" is sufficiently clear and there's no need to choose its hyponym "heterogram". The indication of a "foreign origin of the sign" is implicit in the further indication of the logogram as a Sumerogram or Akkadogram. Moreover, "Logogram" has the advantage of also being a very familiar word for people studying Akkadian and Sumerian: that to me is one big point in favour of its use. Sartma (talk) 16:29, 2 June 2021 (UTC)[reply]
I can concur; I know next to nothing of Sumerian and that stuff, but having "Logogram" as the header tells us enough; having each definition preceded by label "sumerogram of" and "akkadogram of" is clear, since it informs me of what to look for to know more about it. I actually understand what you're talking about here, which is sufficient for an entry. "Determinative" really should be a possible header, since it's used everywhere in the past, though the explanation you give reminded me more of jukujikun and the like. Knowing that the words make intuitive sense to those unfamiliar with the standard lingo, and having them agree with the accepted in-field jargon makes for this proposed system sufficing in my eyes. 110521sgl (talk) 14:31, 3 June 2021 (UTC)[reply]
  • I will also add here that the Korean hanja entries do not represent logograms (as in the glyphs themselves) but Sino-Korean morphemes, and most are closer to full lemmas than soft redirects. If anything the "Morpheme" header would be more appropriate, except that most Korean linguists agree that many Hanja used in modern Korean are not genuinely productive morphemes in modern Korean, especially given the decline of Literary Chinese education. So Hanja is really the only header that fits.--Tibidibi (talk) 16:10, 2 June 2021 (UTC)[reply]
@Tibidibi True, hanja in modern Korean are not logograms. They're just a different way to spell Sino-Korean morphemes, as you say. So, for example, 椅子 is just a different spelling of 의자. In modern Korean it's just a question of stylistic choice. Sartma (talk) 16:48, 2 June 2021 (UTC)[reply]
“determinative” should probably be added since taxogram and semagram are much less used and classifier seems restricted for a thing that is used with numerals, though determinative has another meaning we list and I personally prefer taxogram and semagram because these elite words are unambiguous and parallel to other -grams, and I see semagram is used with another meaning by word-gamesters (the one I knew first we don’t have yet, as with heterogram, I’m finna fix it). Fay Freak (talk) 16:47, 1 June 2021 (UTC)[reply]
Here too, I'm not against it in principle, but for the same reasons I'd prefer to keep Sumerogram/Akkadogram, I'd prefer to keep Determinative too. This is the word used in every Akkadian and Sumerian reference material (Dictionaries, textbooks, essays...); it would be confusing/alienating if we used something unusual in the field. Sartma (talk) 19:22, 1 June 2021 (UTC)[reply]
Support the original proposal. Nobody actually uses heterogram when working on these languages, so we'd just be causing confusion for no gain; it's not like we have a finite number of L3s we can use. —Μετάknowledgediscuss/deeds 16:46, 3 June 2021 (UTC)[reply]
Support 110521sgl (talk) 19:52, 3 June 2021 (UTC)[reply]

definite/indefinite articles again

Hi @JoeyChen, I asked on your discussion page why you removed grammatical articles from glosses. You didn't answer and you insist on continuing to do it: Special:Diff/62639656. Then I raised the issue in BP last month and no opinions were offered in favour of your practice, but neither were any firm and clear guidelines offered against it. I think such a fundamental disagreement deserves a coherent discussion - perhaps even a vote? Surely it can't be that difficult to decide. Please engage. Brutal Russian (talk) 19:39, 2 June 2021 (UTC)[reply]

I can see why they're doing it. I'm quickly going to look at a physical dictionary for Latin real quick. Oh. I thought I remembered there being indefinite articles in it, but apparantly dictionaries don't give articles for Latin nouns. So JoeyChen's doing it right. (Woordenboek Latijn/Nederlands zevende herziene druk Amsterdam University Press, 2018) 110521sgl (talk) 14:35, 3 June 2021 (UTC)[reply]
@110521sgl:"Right" in the context of wiktionary is what corresponds to our editing policies/guidelines. These can be influenced by what other dictionaries are doing, but what other dictionaries are doing does not determine what we consider to be "right"; moreover, if one wants to determine what other dictionaries are doing, consulting just one isn't enough. {{R:OLD}} uses articles the way our English definitions use them, definite and indefinite; it does present its definitions as sentences finished by a stop. {{R:L&S}} seems to be inconsistent, but does use them in the same word; see further "FriezeDennisonVergil" on the same website (at the top, only for words used by Virgil). It seems to me that the way English speakers choose to present English definitions is a good guide to the natural way to present them, and this would make article-less glosses aberrant. In addition, translations in templates like {{m}} generally require the use of articles to distinguish parts of speech, and it's simply better when the definitions in these templates consistently reflect the definitions in the entires. Otherwise, why not gloss verbs without the to for ultimate confusion? Finally, do you really find it desirable to gloss eg. cantiō, pugna as "singing", "fighting"? If not, what's the point of making an exception for disambiguation instead of introducing a general rule? Brutal Russian (talk) 21:50, 3 June 2021 (UTC)[reply]

CFI for foreign languages should be spelt out

Hi. I think the CFI for foreign languages is not clearly spelt out. If I understand correctly, the present consensus followed is:

  1. English Wiktionary should have entries for all foreign natural language words that exist in the foreign natural language. The definitions and descriptions should be given in English. Title should be in the foreign script.
    • Entry layout for foreign terms is identical except that it should not have Translations sections
  2. Foreign translations of English words should be added to the translations section of English entries.

I had long time not contributed anything to Wiktionary as I was unsure to what extend can foreign languages be added. Note that English WT has very less Indic language content (only 14,000 Hindi entries and just 1,400 Malayalam entries) despite their respective language versions of WT have over hundred thousand entries. WT:Statistics. So, please clear out the inclusion criteria for non-English languages in the CFI and other policy pages. I wish that a WT:Foreign languages page will be created. Thank you! Vis M (talk) 22:43, 3 June 2021 (UTC)[reply]

The main blocker is the nominal policy that we shouldn't just include an alleged word because some other dictionary has the word. The other is that we need usable translations into English of these words. Strictly, for non-English words, we give translations rather than meanings, though I think a lot of editors go for meanings rather then translations. There's no policy reason why English Wiktionary shouldn't include most of those lemmas - the reason is shortage of labour. On the other hand, there can be policies excluding some very obvious inflections, as for English. --RichardW57 (talk) 05:49, 4 June 2021 (UTC)[reply]
Ok, thank you very much! Vis M (talk) 00:24, 6 June 2021 (UTC)[reply]
The inclusion criteria for English are the same as for all other well-documented languages.__Gamren (talk) 23:32, 10 June 2021 (UTC)[reply]

Multiple click policy for subordinate entry

There is a policy, though I don't know that it is documented, that the collection of meanings or translations for a word are kept at the main entry rather than duplicated across alternative forms or inflections. What, then, are the allowed uses of the gloss fields in many of the linking templates, such as {{inflection of}}, {{alternative form of}} and {{sa-sc}}, or indeed {{bor}}? --RichardW57 (talk) 05:35, 4 June 2021 (UTC)[reply]

One view, promulgated by Inqilābī, is that, "We provide the meaning in nonlemma entries only when there are multiple definitions in the entry", essentially that the purpose of these short glosses is to distinguish the lemmas. He's used this view to delete one of my brief gloss given for Pali သီလ (sīla, habit), which technically is a lemma, though some prefer to call it a soft redirect. (It actually stores script-specific information; by @AryamanA's rejection of the use of data-modules for word-specific information, irregular inflection can cause these subsidiary lemmas to involve a fair bit of work. Pali seems to have a host of irregularities.) --RichardW57 (talk) 05:35, 4 June 2021 (UTC)[reply]

I have been taking the view that these glosses can provide a one-stop service to the user who has temporarily forgotten the word; if he wants more meaning, he can click on, but if the reminder is enough, job done. So, may we attempt to be user-friendly by providing memory-jogging glosses? --RichardW57 (talk) 05:35, 4 June 2021 (UTC)[reply]

I also sometimes provide a brief gloss in non-lemma even when the lemma entry is unambiguous, especially if the main lemma is more than one click away (e.g. an alternative spelling of an inflected form or a mutation of an inflected form). —Mahāgaja · talk 15:08, 4 June 2021 (UTC)[reply]

User pages as self-contained lexicons

What do people think about User:Turkish Glossary of Untranslatable Expressions, and User:Términos de la psicología (earlier version User:Jimena rc)? While the subject matter is relevant to a dictionary, these are completely isolated from the rest of Wiktionary- none of these accounts have made any edits outside of their own user and user talk pages.

I'm probably going to delete the "psicologia" ones either way because the definitions are all in Spanish, but I think we need to discuss this- it's starting to look like a trend. Chuck Entz (talk) 21:19, 4 June 2021 (UTC)[reply]

I think they are in need of a kind of software for their vocabulary records, and are here because they have been conditioned to seek out a SaaSS. Fay Freak (talk) 02:24, 5 June 2021 (UTC)[reply]
This is an improper use of user names. As to the Turkish list: if attestable, why are these entries not simply terms/phrases in mainspace? (In fact, some are: (bacanak, cümbür cemaat, ellerine sağlık, kaçıncı, ulan/lan, üşenmek.) It is not at all unusual that some term has no direct equivalent in another language, or that some idiom does not make sense when translated word for word, or needs a usage note to explain when it can be used. The list has the appearance of having been copied from elsewhere, what with the remark “also seen in the photograph” while there is no photograph.  --Lambiam 17:13, 6 June 2021 (UTC)[reply]
My recent favourite "untranslatable" Spanish term is por el culo te la hinco. I was wondering how I'd translate that if it was in a film - probably have the character sing "Ah Ah Ah Ah Number Five Number Five" Beegees-style as a relatively humorous alternative. Indian subcontinent (talk) 20:52, 6 June 2021 (UTC)[reply]
...which probably explains why I'm not a film-script translator. Indian subcontinent (talk) 20:53, 6 June 2021 (UTC)[reply]
“In yer tewel I swive” Fay Freak (talk) 21:07, 6 June 2021 (UTC)[reply]
“In yo' mom's muff I dive” Indian subcontinent (talk) 22:54, 6 June 2021 (UTC)[reply]

English nouns lacking genitive form

The following pronouns have no genitive formː there & relative which. The following common nouns are not found with a genitive form eitherː umbrage, sake, dint, worth, behalf, lack, basis, extent, means, stead, shrift, spate, heed, & cusp. What's a good way to deal with this?--Brett (talk) 16:35, 6 June 2021 (UTC)[reply]

Maybe I'm just dumb but how is umbrage any different from anger? I can't see how a possessive or genitive is acceptable for one but not the other. —Justin (koavf)TCM 20:40, 6 June 2021 (UTC)[reply]
I don't see anything that need to be "dealt with", to be honest. Make a list in a subpage of your username, I guess Indian subcontinent (talk) 20:47, 6 June 2021 (UTC)[reply]
Better corpuses? Better analysis? You ought to find that 'whose' does function as the possessive of 'which'. As a mathematician, I have no problem pondering a basis's cardinality. (It's in print as, "In fact, for any two vector spaces A and B, we can always find a vector space C, whose basis’s cardinality is big enough, such that A ⊕ C = B ⊕ C.") And googling quickly turned up, "Then we consider the case of unknown cusp's order and derive an adaptive wavelet estimator with the uniform rate slower only by a log n factor than the corresponding rate for known ffi." I also found, "Business Insider calculated that Amazon CEO Jeff Bezos made $160,000 per minute at his net worth's peak September 2018".
If such a lack were real and noteworthy, the 'Usage notes' seem a sensible place to mention such a lack. --RichardW57 (talk) 22:22, 6 June 2021 (UTC)[reply]
"the richness of sake's taste" FTW Indian subcontinent (talk) 22:59, 6 June 2021 (UTC)[reply]
That's just taunting. --RichardW57m (talk) 11:06, 7 June 2021 (UTC)[reply]
"If first extent's measurement in EAD = boxes, enter boxes in ASpace type." --RichardW57m (talk) 11:06, 7 June 2021 (UTC)[reply]
I suspect certain verb forms also happen not to have a 'genitive form', such as 'am'. Do clitic forms of verbs take the possessive clitic, or does it force the clitic to decliticise? This question seems to be more of an issue for a grammar rather than a dictionary. The clitic's realisation seems to be variable after 'is' and 'was', even amongst those who have mastered the apostrophe. (The question is whether the 'repeated morph constraint' gets applied.) --RichardW57m (talk) 11:06, 7 June 2021 (UTC)[reply]

Template:P: for "pronunciation"

I asked for opinions on creating pronunciation usage templates back in March, but didn't receive any. Since then I've only created one ({{U:la:pron-dropvowel}}) because it rubs me the wrong way to create templates with monstruously long names. These result from the need to specify what type of usage template it is, for example. In my opinion the type is best distinguished by the capital-letter, and so I've just made {{P:la:4decl-neut}}, where P stands for "pronunciation". I'm not sure if Etymology needs its own letter, but no other sections that do come to mind, since the rest of the entry is basically treated as one section and the note generally appears under Usage notes. Do you think this is a good approach? Earlier-created pronunciation notes are often found in the Usage notes section, which I think is the wrong place for them, and I've been consistently putting them under Pronunciation. Brutal Russian (talk) 00:32, 7 June 2021 (UTC)[reply]

I'd go ahead and do it, and if it breaks anything someone will eventually realise. Also, don't worry about long names in templates - we have long-named stuff like Template:RQ:Chapman Mask of the Middle Temple and Lincoln's Inn and Template:RQ:Denham On the Earl of Strafford's Tryal and Death Indian subcontinent (talk) 22:35, 7 June 2021 (UTC)[reply]

Universal Code of Conduct News – Issue 1

Universal Code of Conduct News
Issue 1, June 2021Read the full newsletter


Welcome to the first issue of Universal Code of Conduct News! This newsletter will help Wikimedians stay involved with the development of the new code, and will distribute relevant news, research, and upcoming events related to the UCoC.

Please note, this is the first issue of UCoC Newsletter which is delivered to all subscribers and projects as an announcement of the initiative. If you want the future issues delivered to your talk page, village pumps, or any specific pages you find appropriate, you need to subscribe here.

You can help us by translating the newsletter issues in your languages to spread the news and create awareness of the new conduct to keep our beloved community safe for all of us. Please add your name here if you want to be informed of the draft issue to translate beforehand. Your participation is valued and appreciated.

  • Affiliate consultations – Wikimedia affiliates of all sizes and types were invited to participate in the UCoC affiliate consultation throughout March and April 2021. (continue reading)
  • 2021 key consultations – The Wikimedia Foundation held enforcement key questions consultations in April and May 2021 to request input about UCoC enforcement from the broader Wikimedia community. (continue reading)
  • Roundtable discussions – The UCoC facilitation team hosted two 90-minute-long public roundtable discussions in May 2021 to discuss UCoC key enforcement questions. More conversations are scheduled. (continue reading)
  • Phase 2 drafting committee – The drafting committee for the phase 2 of the UCoC started their work on 12 May 2021. Read more about their work. (continue reading)
  • Diff blogs – The UCoC facilitators wrote several blog posts based on interesting findings and insights from each community during local project consultation that took place in the 1st quarter of 2021. (continue reading)


placement of inline synonyms

Do inline synonyms/antonyms as specified using {{syn}}, {{ant}}, etc. go before or after usage examples? Benwing2 (talk) 01:39, 12 June 2021 (UTC)[reply]

Before, I think. Imetsia (talk) 02:19, 12 June 2021 (UTC)[reply]
It was left unregulated, I discovered shortly after the vote introducing them and you probably have read: But there one has argued for before. Which I now also prefer mostly because otherwise the quotes push away the semantic relations on expansion but you would like the synonyms and company near the definition to even understand the definition or you wonder where they went. Fay Freak (talk) 03:01, 12 June 2021 (UTC)[reply]
Given that almost everyone in that discussion wanted them placed before usage examples, and I agree, and WT:ELE agrees as well, I've changed the documentation of all inline *nyms to indicate that they go before usage examples. Benwing2 (talk) 04:23, 12 June 2021 (UTC)[reply]

Pinyin capitalization

@Justinrleung, Suzukaze-c, Tooironic, 沈澄心 Should the pinyin of the names of ethnic groups be capitalized here on Wiktionary? In 現代漢語詞典 they are capitalized even though they are classified as nouns. An example is Hànzú for 漢族. The same is true for 漢人, 漢語 and 漢字, but not 漢姓. RcAlex36 (talk) 11:25, 12 June 2021 (UTC)[reply]

Thanks for the ping. I don't have an opinion on this matter. ---> Tooironic (talk) 11:33, 12 June 2021 (UTC)[reply]
Also @Frigoris, Bula Hailan. RcAlex36 (talk) 12:02, 12 June 2021 (UTC)[reply]
@RcAlex36 See this link 汉语拼音正字法基本规则, in section 6.3.2, the Pinyin transcription of 景頗族 is capitalized. Bula Hailan (talk) 12:10, 12 June 2021 (UTC)[reply]
I don't think 現代漢語詞典 has a separate label for proper nouns, but even so, I think these should be capitalized. — justin(r)leung (t...) | c=› } 16:04, 12 June 2021 (UTC)[reply]
@Suzukaze-c, 沈澄心 Any opinion on this? RcAlex36 (talk) 05:28, 13 June 2021 (UTC)[reply]
I personally support capitalization but do not care if someone supports the opposite. —Suzukaze-c (talk) 05:31, 13 June 2021 (UTC)[reply]
With no editor opposing capitalization I will go ahead and capitalize the pinyin of words in question. RcAlex36 (talk) 09:24, 14 June 2021 (UTC)[reply]
@Atitarev made this change ([1]); I'm pinging that editor in case they have a comment. I don't have an opinion on it, but I will note that Xiandai Hanyu Guifan Cidian has no capitalized pinyin forms, but Xiandai Hanyu Cidian does (as noted above). --Geographyinitiative (talk) 12:32, 14 June 2021 (UTC)[reply]
Note section 6.3.3 in which the pinyin for 漢語 is written as Hànyǔ. Also, Xiandai Hanyu Cidian, being the primary prescriptive standard for Standard Chinese, should be considered more authoritative than Xiandai Hanyu Guifan Cidian in my opinion. RcAlex36 (talk) 14:19, 14 June 2021 (UTC)[reply]

WingerBot is making entries worse by changing {{l|en|a}} in definitions to [[a]]. The former is superior because it links to the intended definition, not to the top of a large page that happens to contain a definition. See Special:diff/62729111. Compare a to a. I have to scroll down 22 pages to reach the English section if I follow the page link. Vox Sciurorum (talk) 11:35, 12 June 2021 (UTC)[reply]

I'm not sure the first is superior, since it also uses Lua (which can be of significance in some pages). Also, the English section is always the first or second one on the page, so you can click it in the contents box. That said, I'm not sure machine-changing these is a good idea without consensus, since I can imagine some editors prefering the former style over the latter. pinging @Benwing2 Thadh (talk) 11:59, 12 June 2021 (UTC)[reply]
It may be tedious doing it by hand, but a bot should have no trouble using [[a#English|a]] to replace {{l|en|a}}. Chuck Entz (talk) 17:42, 12 June 2021 (UTC)[reply]
@Vox Sciurorum This has been discussed before and I think people were generally in favor of raw links for English. Generally this is how people enter the definitions anyway; it's annoying to enter templated links everywhere when creating definitions by hand (which is how it has to be done). The vast majority of pages for English words don't have large tables of contents at the top, and the English definition is almost always the top definition, so usually it's not an issue. If this is really an issue, we can use templated links only for the pages with large tables of contents. Furthermore, most of the time words like a aren't even linked in definitions; how many times do you need to check the definition of a word like this anyway? Benwing2 (talk) 18:01, 12 June 2021 (UTC)[reply]
I think plain wikilinks work in definitions, etymologies, etc, for Translingual terms as well, whether CJKV or taxonomic, except inside certain templates like those in the {{der}} family. DCDuring (talk) 18:32, 12 June 2021 (UTC)[reply]

Convert Italian noun plural forms to noun forms

Plurals are the only possible non-lemma forms of nouns in Italian, so there's really no point in having a category Category:Italian noun plural forms distinct from Category:Italian noun forms. For this reason, I plan to run a bot to convert all Italian 'noun plural forms' to plain 'noun forms' and remove the category Category:Italian noun plural forms. This would make Italian work like English and Spanish (which likewise have only plural non-lemma noun forms, which are placed in the 'noun forms' category directly). The same thing should be done in French. Benwing2 (talk) 01:31, 13 June 2021 (UTC)[reply]

Per above and Category talk:English noun plural forms § RFM discussion: March–May 2019. J3133 (talk) 01:45, 13 June 2021 (UTC)[reply]

How do we handle the interaction with the clitic -'s? In many ways, it still works like a case form. --RichardW57m (talk) 11:48, 14 June 2021 (UTC)[reply]

We only have a small handful of pages that are noun forms but not noun plural forms, and the change was already made for English; Italian doesn't have this issue. I support moving the Italian, French, and English proper noun categories. However there are still 179 categories in Category:Noun plural forms by language. Should they all be moved to "X noun forms" even if they have case systems? No one has complained so far about the 51,671 pages in Category:German noun forms versus the 174 in Category:German noun plural forms. Ultimateria (talk) 17:21, 14 June 2021 (UTC)[reply]
@Ultimateria I don't think there should be anything in Category:German noun plural forms. In general, "noun plural forms" doesn't really make sense for languages with case because there usually isn't a single plural noun form. German is a partial exception in that nouns with plurals in '-n' and '-s' have the same form for all cases, but I still don't see the point of a 'noun plural forms' category there. Benwing2 (talk) 04:03, 15 June 2021 (UTC)[reply]
@Ultimateria, Benwing2, J3133 It makes some sense when the plural has a separate stem, as most notably in Semitic languages. However, it is these stems that one would want to cpature. --RichardW57m (talk) 11:30, 15 June 2021 (UTC)[reply]

Italian numbers as adjectives

It appears that all Italian cardinal numbers are listed as both numerals and adjectives. The way it seems to have gotten this way is that User:SemperBlotto made all Italian numbers be marked as both nouns and adjectives around 2008, and User:Ultimateria converted the nouns to numerals in 2020, leaving the adjectives. I don't believe "adjective" is a correct POS and am planning on deleting the adjective POS from all of the numbers. Benwing2 (talk) 00:57, 14 June 2021 (UTC)[reply]

By all means delete them. Ultimateria (talk) 01:14, 14 June 2021 (UTC)[reply]
Agreed, although I wouldn't bother for some of the larger numbers. Many of the cardinal-number entries are in the process of being deleted outright (see the category talk page). So I don't think we should waste time first editing the categories for entries that will be deleted soon anyways. Although I do invite other admins to continue the work of deleting that large mass of cardinal numbers per our previous vote. Imetsia (talk) 01:21, 14 June 2021 (UTC)[reply]
numbers are considered adjectives in Italian though. Why would Adjective be wrong? Sartma (talk) 05:27, 14 June 2021 (UTC)[reply]

Wikimania 2021: Individual Program Submissions

Dear all,

Wikimania 2021 will be hosted virtually for the first time in the event's 15-year history. Since there is no in-person host, the event is being organized by a diverse group of Wikimedia volunteers that form the Core Organizing Team (COT) for Wikimania 2021.

Event Program - Individuals or a group of individuals can submit their session proposals to be a part of the program. There will be translation support for sessions provided in a number of languages. See more information here.

Below are some links to guide you through;

Please note that the deadline for submission is 18th June 2021.

Announcements- To keep up to date with the developments around Wikimania, the COT sends out weekly updates. You can view them in the Announcement section here.

Office Hour - If you are left with questions, the COT will be hosting some office hours (in multiple languages), in multiple time-zones, to answer any programming questions that you might have. Details can be found here.

Best regards,

MediaWiki message delivery (talk) 04:18, 16 June 2021 (UTC)[reply]

On behalf of Wikimania 2021 Core Organizing Team

Swedish common gender

Like the standard language, the swedish noun entries on wiktionary have two genders common/neuter. However, since almost every (traditional) dialect has the three masculine/feminine/neuter, it would be better to split the common gender template into something like c/f and c/m so both dialects and standard language would be accommodated equally in this aspect. There are also a few nouns that have different gender in different dialectareas. — This unsigned comment was added by ASkyr (talkcontribs) at 09:40, 18 June 2021 (UTC).[reply]

This is a good suggestion, and one that I myself have been thinking of making. There are still certain noun classes that have a strong connotation with feminine or masculine gender, for instance the nouns with -a in singular and -or in plural are historically feminine, and intuitively seen by natives as such, while the nouns with -e in singular and -ar in plural are likewise, but masculine.
Not to mention that the distinction between masculine and feminine was still largely existing in the written language of the 1600s and 1700s, which counts as Swedish and thus, being attested, should be included in the dictionary. I should add that SAOB, the Dictionary of the Swedish Academy, also lists nouns as "r. l. f." (common or feminine) and "r. l. m." (common or masculine), rather than only "r." (common) Mårtensås (talk) 18:58, 20 June 2021 (UTC)[reply]
I think this would be useful information, whether we put it in the headword line or at least in declension tables, and given what's been said above about how it's necessary for describing not only dialects but the early modern language, and how another major dictionary includes it, I'm inclined to include it. German entries sometimes mention a noun's varying gender in "dialects" via usage notes, but the number of Swedish nouns where this information would be applicable seems so high that it should go somewhere more "regular", like the headword line or declension table. (I suppose there may be some modern coinages/borrowings which don't have a traditional gender besides common, though, yeah?) - -sche (discuss) 21:16, 20 June 2021 (UTC)[reply]

{{bor+}} and {{inh+}}

Despite the failed vote, Creation of Template:inh+ and Template:bor+, @SodhakSH (and Brutal Russian) went ahead and created the templates {{bor+}} and {{inh+}}. They should be deleted and locked to prevent them from being recreated. @Metaknowledge, Thadh, Donnanz, Fenakhay, DannyS712, Fay Freak, -sche, Jberkel, Mahagaja --{{victar|talk}} 18:46, 18 June 2021 (UTC)[reply]

I agree, but this seems like a discussion for WT:RFDO, not the Beer Parlor. —Mahāgaja · talk 19:07, 18 June 2021 (UTC)[reply]
RFD created as well. --{{victar|talk}} 20:36, 18 June 2021 (UTC)[reply]
I think lengthy case presentings shouldn't be part of RFDs, so I'm going to give mine here, where it seems to be tolerated.
These templates' function is to make editing easier for a small number of editors (by name, mostly, but probably not exclusively, SodhakSH, Inqilābī and Brutal Russian), and arguably create a regular wording for etymology sections (although I, as well as others, dispute that). The necessity of the text that is now being displayed using these templates is very much disputed to a degree that a supermajority (13 to 5) has voted to abolish giving the text within the {{bor}} template. Of course, one could argue that adding the template {{bor+}} isn't contradictory to that vote, but seeing as the vote concerning adding bor+ also failed I wouldn't be so certain of that.
Some bring up the issue that AryamanA voted just past the time and thus failed to make the difference, but seeing as PUC was also going to vote oppose (mind you, an oppose vote is worth twice a support), the vote would have failed anyway.
Now, some have brought up that a new template's creation shouldn't need any vote, but I'd argue that since this template is one in a series of arguably most used templates (after {{head}}, {{l}} and {{m}}), any creation of a template that takes over a part of or even the whole function of {{bor}} or {{inh}} should get a vote, which it did in this case, and would have even without the initiative of the template's advocates.
So, to reiterate, the proposal to create these templates was turned down in a democratic process that is of the highest form we have. An RFD discussion is of no value, since it's not as important as a vote, and as such I, and anyone who agrees with me, plead to the administrators of this project to delete these templates, lock them and either create another vote (which I personally would find absurd, since we just finished this one, and we are currently not in the season where the majority of Wiktionary editors is regularly editing), or just ban the creation of such templates until a supermajority of Wiktionary editors actually agrees that this template should be created. I thank you for your time.@Victar, SodhakSH, Inqilābī, Brutal Russian, Mahagaja, Fenakhay, Imetsia, Benwing2, PUC, Lambiam, Andrew Sheedy, Bhagadatta, Chuck Entz Thadh (talk) 11:09, 21 June 2021 (UTC)[reply]

Categorization bot

I would like to create a bot to populate Category:English three-letter words (using the method outlined by User:Thryduulf) and Category:English terms with multiple etymologies (by checking for multiple etymology sections), and any other under-populated categories I find. Are these categories worth populating? Is there a better way to populate them than by using a bot?

I have experience with Python and from what I've seen I think I could make the bot using Pywikibot fairly easily. —TeragR disc./con. 02:01, 19 June 2021 (UTC)[reply]

Yeah, this seems like it could be done relatively easily. Though I am no competent expert, I'd say populating the categories could well be worthwhile (there are all kinds of categories out there, after all, many of which have less utility than this proposal), and a bot would probably be the best way to do it. Moreover, this kind of edit should be minor and easy enough to achieve, right? Have you already begun working on it? Support your idea :) Kiril kovachev (talk) 16:42, 20 June 2021 (UTC)[reply]
Yes, the edits would be simple and minor. I wanted to ensure there was consensus before writing the code, so I have not started that yet. —TeragR disc./con. 22:30, 20 June 2021 (UTC)[reply]

Please provide input here or on Meta and during an upcoming Global Conversation on 26-27 June 2021 about the Movement Charter drafting committee

Hello, I'm one of the Movement Strategy and Governance facilitators working on community engagement for the Movement Charter initiative.

We're inviting input widely from users of many projects about the upcoming formation of the Movement Charter drafting committee. You can provide feedback here, at the central discussion on Meta, at other ongoing local conversations, and during a Global Conversation upcoming on 26 and 27 June 2021.

"The Movement Charter drafting committee is expected to work as a diverse and skilled team of about 15 members for several months. They should receive regular support from experts, regular community reviews, and opportunities for training and an allowance to offset costs. When the draft is completed, the committee will oversee a wide community ratification process." (Creating the drafting committee)

Further details and context about these questions is on Meta along with a recently-updated overview of the Movement Charter initiative. Feel free to ask questions, and add additional sub-sections as needed for other areas of interest about this topic.

If contributors are interested in participating in a call about these topics ahead of the Global Conversation on 26 and 27 June, please let me know. Xeno (WMF) (talk) 16:48, 19 June 2021 (UTC)[reply]

The three questions are:

  1. What composition should the committee have in terms of movement roles, gender, regions, affiliations and other diversity factors?
  2. What is the best process to select the committee members to form a competent and diverse team?
  3. How much dedication is it reasonable to expect from committee members, in terms of hours per week and months of work?

Change "Korean Language Family" to "Koreanic Language Family"

As mentioned on the talk page Module talk:families/data a few months ago, in line with the current nomenclature, the canonical name of the language family that includes Korean, Jeju, and extinct languages from the Korean peninsula should be "Koreanic" and not "Korean." Glottolog, Wikipedia, Wikidata, recent literature & research, along with Wiktionary's own entry for Koreanic, list it as the title for the language family; thus, the change should be made officially in Module:families/data. AG202 (talk) 18:04, 19 June 2021 (UTC)[reply]

Support--Tibidibi (talk) 13:59, 20 June 2021 (UTC)[reply]
Pinging Korean editors. (Notifying TAKASUGI Shinji, Atitarev, HappyMidnight, Tibidibi, B2V22BHARAT, Quadmix77, Kaepoong, Omgtw15): Fenakhay (تكلم معاي · ما ساهمت) 16:06, 20 June 2021 (UTC)[reply]
Support, seems like there's no reason not to change this. Kiril kovachev (talk) 16:34, 20 June 2021 (UTC)[reply]
SupportOmgtw15 (talk) 18:15, 20 June 2021 (UTC)[reply]
Support: Koreanic is rare but established in the linguistic circle. — TAKASUGI Shinji (talk) 02:31, 21 June 2021 (UTC)[reply]
Pinging @幻光尘 since it was originally their suggestion in Module_talk:families/data that spurred this proposal here. Thank you! AG202 (talk) 07:48, 21 June 2021 (UTC)[reply]

I propose that only {{swp}} be used in dictionary entries to link to the corresponding Wikipedia page. Some editors do be converting instances of {{wp}} to {{pedia}}; but the problem therewith is that the latter templet is put beneath ===Further reading===: which is but a wrong practice! The heading is meant only for references (non-inline ones), while Wikipedia (or any other sister project) cannot technically be used as a reference. Therefor, {{pedia}} should be deprecated; and so should be {{wp}}, for it has fallen out of favour with many users. Thus, {{swp}} should be the only templet available for linking to encyclopedia articles. ·~ dictátor·mundꟾ 01:54, 20 June 2021 (UTC)[reply]

Wiktionary:Entry layout § Further reading (this was voted): “This section may be used to link to external dictionaries and encyclopedias, (for example, Wikipedia, or 1911 Encyclopædia Britannica) which may be available online or in print.” The claim it is wrong is a fallacy as you redefined it (“meant only for references”); rubbish from Inqilabi is not surprising. J3133 (talk) 02:43, 20 June 2021 (UTC)[reply]
I would further note that the following paragraph from there:
This section is not meant to prove the validity of what is being stated on the Wiktionary entries (the “References” section serves that purpose).
I have frequently put sources as to the gender of words under "References", though as I have been told (it felt more like an instruction) that 'we do not use inline references', I didn't even struggle to make the source of the gender explicit. It has annoyed me to see References changed to Further reading - I now learn that these changes were actually damage. --RichardW57 (talk) 13:51, 20 June 2021 (UTC)[reply]
I have personally never used {{swp}} and always use {{wp}}, and see it being used all the time, so I don't understand what you mean by it being "fallen out of favour". Thadh (talk) 11:17, 20 June 2021 (UTC)[reply]
@J3133: By encyclopedia I was referring to Wikipedia; WP cannot be used as a reference in dictionary entries, as it is only a sister project. That’s we have the templets {{wp}} and {{swp}} to link to the WP page. (By the way, you can make make personal attacks at me, but try to spell my name aright.) ·~ dictátor·mundꟾ 14:45, 20 June 2021 (UTC)[reply]
@RichardW57: ===Reference=== is used only for inline sources, whilst ===Further reading=== for non-inline sources. Otherwise both are the same. (See this entry for an example of an approximate use of the headings.) ·~ dictátor·mundꟾ 14:45, 20 June 2021 (UTC)[reply]
@Thadh: I myself have used* both {{wp}} and {{swp}}, but as I said, it has fallen out of favour with some editors (not I) as you saw in the diff. [* But lately I have been using only the slimmer version because some editors do be substituting {{wp}} with {{pedia}} (which I see as counterproductive).] ·~ dictátor·mundꟾ 14:45, 20 June 2021 (UTC)[reply]
“WP cannot be used as a reference”: try to read again, “Further reading” is “not meant to prove the validity of what is being stated on the Wiktionary entries (the “References” section serves that purpose)” (thus “References” and “Further reading” are not “the same”). Also that was a note of you making your own rules (as when you thought you can add new templates and new parameters and insisted no one can delete them, as was proved otherwise, before your ad hominems). “It has fallen out of favour with some editors”: Irrelevant, as {{wp}} is used by more editors, thus the opposite can be argued. J3133 (talk) 14:48, 20 June 2021 (UTC)[reply]
Here is point 5 (which passed) of the vote (Wiktionary:Votes/2016-12/"References" and "External sources") that implemented the difference between “References” and “Further reading” (then “External sources”, later renamed) before the naive Inqilabi[sic] makes a fool of himself:
Allowing the usage of "External sources" only in cases where other dictionaries and encyclopedias (including Wikipedia) are listed as suggestions of places to look, without serving as proof for specific statements in the entry.”
Wikipedia is explicitly mentioned as being allowed. J3133 (talk) 15:07, 20 June 2021 (UTC)[reply]
But in reality no one has objected to @Jberkel and others’ continual substitution of {{wp}} with {{pedia}} (I for one definitely think it’s counterproductive). You mostly edit English entries, so you may not be aware of the actual usage of ===References=== & ===Further reading===. And the vote mentions Wikipedia only because {{pedia}} was formerly used beneath ===External sources===; after this heading was renamed to ===Further reading===, the vote was not officially updated, but per our prevalent practice, WP cannot be used beneath ===Further reading===. (Note that our English entries, being the oldest ones here, are the least updated.) ·~ dictátor·mundꟾ 15:18, 20 June 2021 (UTC)[reply]
“per our prevalent practice, WP cannot be used beneath ===Further reading===”: You mean your practice, as this is not our practice for most of us. Others do not support this new practice you and presumably some others use (and thus this proposal will likely fail). J3133 (talk) 15:20, 20 June 2021 (UTC)[reply]

New iOS app based on Wiktionary - Vedaist

Hello! I wanted to introduce a new iOS English dictionary app that is based on Wiktionary data. Please check http://www.vedaist.com/ if you're interested. I currently show a very minimal meaning of a word. Noun, verb, adjective or adverb sections for English meanings are shown. Over time I'll be adding more features.

I wanted to acknowledge the great work all of you have put into building Wiktionary, and making it possible for me to build on top of your work. If there is any feedback for me, please reach out. Thanks! — This unsigned comment was added by Toucanvs (talkcontribs) at 08:31, 20 June 2021 (UTC).[reply]

Shouldn't you acknowledge Wiktionary on the app pages, to conform to the CC BY-SA 3.0 License?  --Lambiam 11:29, 20 June 2021 (UTC)[reply]
I don't have iOS, so I can't give much feedback on that front, but indeed, it would be good to credit the Wiktionary authors for your uses. ^^ I am a big fan of the adless, trackerless paradigm, though, so I hope that's something you'll never change :) Kiril kovachev (talk) 16:31, 20 June 2021 (UTC)[reply]
@Toucanvs: please add it to Wiktionary:Wiktionary-supported software (don't mind the deletion notice). – Jberkel 06:50, 21 June 2021 (UTC)[reply]

Request for bot consensus

Hello, I have recently been in the process of developing a bot for the purposes of auto-generating derived form entries based on Bulgarian noun conjugation tables. To put it simply: the bot fishes for declension tables across all noun lemmas fed to it, takes note of what derived forms come from what lemmas, and creates definitions based on that data. Once generated, the definitions are either appended to existing entries, as long as they don't already have a Bulgarian section, or a new page is created containing the entry it just generated.

The edits I have already made under my own account using the bot can be viewed here: diff, diff, diff, diff, diff

(apologies, I don't know how to link these properly)

My GitHub repository is linked here - if you have Python knowledge, please run through if you're interested and see if you can spot any bugs. There are a few other resources in there, such as one to help to understand the program more clearly, and some sample output data to show what the edits the bot's making would look like. There are also instructions as to how to run the bot yourself if you wanna trial it out and look for problems.

I additionally wrote a few paragraphs describing the method on my bot's user page, which, if you have any objections to, please let me know once again. If all goes well, I'll post a vote to get the bot approved sometime soon.

If you have any questions or doubts, please ask me for answers, and I will do my best to respond well. One final thing - understandably, few people on here, if any, will have experience with both the Bulgarian language and Python, so - if no one cares, I'll just apply to votes within a few days. However, if there does happen to be anyone interested who could audit for any mistakes, I would be highly grateful for your help. Thanks for reading!

Kiril kovachev (talk) 16:28, 20 June 2021 (UTC)[reply]

@Kiril kovachev: I fixed your links for you. Chuck Entz (talk) 16:49, 20 June 2021 (UTC)[reply]
@Chuck Entz Thanks ^^ Kiril kovachev (talk) 17:07, 20 June 2021 (UTC)[reply]
I don't like new pages being created when there's no evidence that the word has existed in any language. Some inflected forms are best left hiding in the inflection tables. --RichardW57 (talk) 18:50, 20 June 2021 (UTC)[reply]

This started when I fixed the entry capus that was a mix of mainspace Latin and Reconstructed, and that this user had edited under a fundamental misconception of how wiktionary entries operate. This user chose to conduct a discussion via an edit war and aggressively dismissive comments to their reverts. I started a discussion which I was forced to abandon when their replies turned into accusations, rudeness and rants in reply to what an imaginary me inside their head said to them.

—Then the same repeated for fōrmāticus. This time they supplied the Latin entry with an Old French pronunciation, changed the etymology in contradiction to all the etymological references, and even redirected the attested variant form as if it wasn't attested. They replaced a conjecture that the word represents an ellipsis, suppored by numerous references (#1, #2, #3) as if it was pure nonsense. They further replaced "Gaul" with "France" on the grounds that its "entirely anachronistic" because of a single period gloss. I again took this to a discussion. In it the user show their lack of understanding of historical linguistics, including but not limited to the difference between date of attestation and date origination, and conflating attested evidence and conjectured forms as if they had the same strength; the concept of anachronism in adducing a single period gloss in order to condemn modern usage anachronistic, and in appealing to nothing but the linguistic intuition of medieval Franks to define what is and isn't Latin even when specifically warned agains doing so, because they're forced to in order to avoid admitting that they have no familiarity with the modern linguistic side of the question. As well as lack of knowledge of history and of any awareness of the socio-political implications that Francia and other words used to describe these territories is/was fraught with (#1, #2; finally only a complete ignornace of the fact that the word Gallia continued to be used throughout the Middle Ages and even up to today (Gallomania, Gallo-Romance etc) can explain the categorical statements that it's anachronistic for the period.

—Now I'm perfectly aware how petty and ridiculous these matters are, and it's all the more frustrating to me something like this regularly results in a shit-storm when interacting with this user. The reason for these squabbles is obvious to me. The user in question is in the habit of making rash edits and statements, and they have an extremely vulnerable ego. This creates an unfortunate feedback loop of constantly being open to criticism that they can't take, and of having to hide their own droppings. For example, here they they directly accuse me of ignornace - spoiling the well - for revealing to them information they were ignorant of, and when they realise their folly they try to remove my replies: #1, #2. The same combination of ignorance and lack of foresight can be seen in the amusing quip "Romanistics isn't even the name of the field" - in fact, is exactly the same thing as with Gallia and cāseus, only in the latter cases the quip is effected by editing the page, and protests are countered with a shit-storm over nothing and an edit war.

This user is a core example of the "argument is war" metaphor run out of control and into pathology. He's reasonably informed, but in his world-view "knowledge is ammunition", and he uses it to demolish the enemy's "arguments are soldiers". This person equates his own ego to the country being defended, and his persona to the commander in chief, and so the very act of challenging him is perceived to be aggression. This is that unfortunate cases where the more the person knows, the worse they are to interact with. The ego in question is highly vulnerable and I have good reasons to believe is highly narcissistic - something they themselves realise. It is also highly inflated due to overestimating the degree of their knowledge. The net result of the above is that they see people who possess enough knowledge to question their own conclusions and undermine their imaginary authority as highly threatening and need to be proactively destroyed, humiliated and otherwise neutralised before they're able to deal damage to the vulnerable ego; and if the ego has suffered for whatever reason, revenge is exacted on the culprit. I've had quite a bit of experience dealing with and circumventing the "argument is war" metaphor, and I have enough of an understanding of the narcissist's mindest to be confident in my observations.

Their impulsiveness is exemplified by their editing habits which are disruptive to the website. In conversation this same emotional instability surfaces in knee-jerk spiteful replies followed by several hours of editing, often so that by the time you reply they'll have changed half of it. Take notice of the timestamps, as well as here from June 2 - this person's brain locks in a 9 hour-long pathological obsessive feedback loop and by the end of it is so fried that they cannot tell whether they're replying to your actual words, or to a strawmen figure of you. This obviously makes a conversation difficult if not impossible. Another part of this is replacing offensive words with a set of less offensive expressions that are still clearly fighting words. For example, when you read "mildly absurd", understand "moronic". This is often combined with externalising misattribution: "you vehemently deny" means "I'm furious about the fact that you deny", and "apopletic rage" means they're currently experiencing just that (this expression they changed three times).

They went on to continually fixate on me confusing the names of two languages (Breton and Welsh) with no relevant consequences - they were annoyed that I wouldn't verbally admit that I confused them - and further to attribute to me a ridiculous strawman position (#1, #2) to argue against while completely ignoring my protests that this isn't my position, if not being further incited by them. This was done as a thought-blocker designed to prevent understanding what I'm saying, which was that the word's form presents no evidence to the time of borrowing. And the nature of their last edits can be perfectly described by the expression "apopletic rage" and an all-out assault on my person, calling me a beginner in Romance linguistics and trying to discourage me from ever disagreeing or challeninging them ever again, in fact threatening to humiliate me if I do.

Overall, to me they seem to possess enough knowledge to reach correct conclusions, but not to possess enough humility to continually question their intermediate conclusions or even suspend them. Indeed, they possess the opposite trait that encourages them to hang onto their intermediate conclusions with utmost conviction and to dismiss anybody questioning them as both ignorant and an aggressor. Their behaviour of correcting themselves 30 times in a row is a manifestation of the same pattern as that which forces them to impulsively "correct" others - I know because I suffer from a milder variant of the same thought pattern. The difference in the outcome seems to be rooted in my constant self-doubt, factual self-checking and a much higher associated standard of certainty. And of course, my ego can take a far bigger beating.

My only other previous interaction with this user on this website also ended badly, although the start was promising of a fruitful exchange. Unfortunately I quickly understood that their purpose was to disrupt and win points against me. They tried to argue that the existence of one allophone justifies not phonetically transcribing allphony as a whole for a singular case where we have overt descriptions of Latin allophony by native speakers. Towards the end of it they resorted to using the fact that a rare, impressionistic, essentially phonemic and phonetically vacuous IPA symbol [l͈] for the "fortis L" can barely be found in some linguistics handbooks should be interpreted as it being appropriate to use this symbol in an actual phonetic transcription of a dead language which hasn't been shown to possess fortis/lenis consonants even phonemically; as well as to misinterpreting a super-detailed table because its author unfortunately chose to call an 'unspecified for backness' non-velarised L - IPA [l] - "dark", with a palatalised one to the left and full two further degrees of velarisation to the right, the proper IPA [ɫ] being called "darkest". They claimed that the author calling it "dark" overrules the author's actual articulatory description and entire discussion, and the fact the author chose to use a non-IPA diacritic to mark the vowel quality associated with it ([o]) was claimed to be incompatible with simply transcribing it as IPA [l], which stands for the non-velarised (neither palatalised) lateral approximant, is therefore wrong and therefore this user is right and has won and why won't I admit it already by submitting to their reverts.

This user uses references as binary crutches that either justify or disprove everything or nothing because they lack the tools to comprehend the matter in evidential terms and use the references to update their degree of certainty. From what I've seen they lack proper awareness of the distinction between phonological and narrow phonetic transcription. They tried "disproving" the detailed phonological study (and several specialised references) I provided in support of my transcription by citing a 3-line unsourced note on the matter in Sihler 1995's comparative grammar. This is bad enough, but the main reason I chose not to continue that discussion was because they became visibly pissed off ("brazenly denied", "recant your vehement rants") at their own inability to argue coherently and my ability to do so, and because they had clearly revealed their squabbling winner-loser mentality instead of that of a mutual search for truth, and so any willingness for a civil discussion was clearly gone. I also left their reverts in place.

In a following act of aggression in Module:la-pronunc, they replaced my Campanian reconstruction of a historical pronunciation with their Russell's teapot, which is another variation on what my reconstruction had replaced after several prior discussions. They did this despite being aware of at least some these discussions and participating in them under the name Excelsius, and despite knowing that I've been editing the module in a different direction. Again, they feel that having condenced a number of works into a wikipedia article justifies this behaviour - "I have the references, you lose".

I don't object to having a purely phonemic proto-Romance reconstruction (lend a hand if you can), but a phonetic proto-Romance reconstruction makes even less sense than phonetic Old or Middle English. They argue that proto-Romance represents a concrete spoken prestiege variety that was common to everywhere; they believe it to have been significantly later than Classical Latin. Yet those reconstructions that use Sardinian evidence to reconstruct proto-Romance, as I understand it, are forced to postulate the break-up of the PRmc. unity no later than the 3d (preferrably 2nd) century CE, which is to say it's post-Augustan Latin, but comparatively reconstructed; and the phonological difference between attested Classical Latin and the reconstructions are of the same nature as syntactic differences (the future tense or the full system of cases cannot be reconstructed). I see no way to postulate a prestige language variety common to the whole of the empire but dating to the 5th century, for example, and nor have I seen it postulated. The loss of Dacia in the 3d century alone precludes this, with Rumanian being intermediate between Sardinian and the rest in its conservativsm. This paradox - that what you get comparatively isn't what you see attested and thoroughly described by language standardizers - has been known for as long as the Romanistics has existed but is felt especially sharply today (Wright R. in Clackson J. 2011 p.64). It's understood comapartive evidence needs to be synthesized with attested Latin to arrive at likely reality, and this is what my Campanian transcription aims to do; this user mistakes pure reconstruction for a complete linguistic system that actually existed, a claim that most Romanists don't subscribe to. In essence it's an offshoot of Vulgar Latin, which was nebulous enough but at least inextricably linked with written attestations; the reconstruction they use is an excercise in purely comparative reconstruction and makes no claim at being a language in the sense of a sociolinguistic system, even less so at having a single supra-regional standard phonology that they're attempting to imbue it with.

In any case I'm sure I could have worked out these differences with a reasonable person - there are many things that could be fruitfully discussed. But this person is not reasonable, and is not interested in working out differences. They're interested in winning and in humiliating me personally. They take rash one-sided initiative in completely undoing other people's work, and they take being challenged on it as personal offense. They will edit-war you in an attempt to defeat you, to have you give up. I'm a relatively chill person averse to conflict, and so you'll notice that in every single one of those instances I simply bail out of the discussion when I see that reason and civility no longer reigns supreme (although sometimes I do loiter too long). But I cannot do this continuously because this user interprets this as a victory and they proceed to do whatever they like. The other options are to continue feeding oxygen to the fire, which is stupid, or to silently edit-war them, which is wrong. Therefore it seems to me the optimal solution is to simply ban them.

To summarize, never before have I been in conflict with anyone at all on this website, and I enjoy it for this reason among others. I don't ever participate in website drama, here or elsewhere. I do seem to be able to magically pop over-inflated egos with a few words, but most of the time I only slightly prick them to achieve silent deflation. This is because I highly value open-mindendess, civility and consensus as conductive to the acquisition of knowledge, something which an inflated narcissistic ego opposes by nature, being a priori right. For a full disclosure, I might have ended popping this person's ego 3 years ago or so when they stormed into our chatroom fresh from a notorious edgy meme- and teenager-infested place and tried to bring that attitude with them. It might have rendered them permanently jarred ever since: first they've tried trolling me into another conflict (the "go Skype" revenge tactic) before reappearing under several different aliases, all with the same result - roughly how this went can be gleaned from the pronunciation discussion referenced above (the last time I interacted with them anywhere must have been close to a year ago). I say nothing offensive, but merely show them that their pretences at being a Romance language authority are unfounded, and a wiser person would have learned humility from this, but at some level of volatile narcissism this seems to be ruled out. I don't hold grudges and have genuinely attempted to resolve disagreements with this user and show them where their reasoning was flawed. What I got in response was immediate aggression, deeply-rooted bad faith and general verbal military action - fully conscious, as his repeated abusive narcissist's clichés of "you made me do it" and "you started first" testify - until I too dropped any attempts to assume good faith. At the end of that discussion they've clearly demonstrated that they aren't interested in calm and reasoned discussions, that they don't even consider me worthy of being listened to, and to but it bluntly that they're emotionally disturbed. Personally speaking, engaging with them feels like being attacked by a rabid canine. If I seem like a reasonable person to you, if you don't believe I'm simply delusional, I would ask the admins to consider taking some action to prevent The Nicodene from causing further strife on this website. If I know anything about appropriate human conduct (and Wikipedia Etiquette), I believe their combination of traits results in unique toxicity that is detrimental to this website as well as to the atmosphere on it. Mentioning @-sche, Mahagaja, Imetsia, Metaknowledge as admins I've recently interacted with - hopefully this is appropriate. It's taken me a while to decide to write this because I don't believe in heavy-handed moderation and would hate to be seen as settling a personal matter via that route, but I don't see another choice. Brutal Russian (talk) 17:25, 20 June 2021 (UTC)[reply]

I am a user given to writing long comments. I have to try to be more concise. I want to reach out and suggest to you that you that a comment of this length is not consistent with the mission of a volunteer dictionary website. The very length of the comment is 'detrimental to the atmosphere' of the website. That's my comment and suggestion. Thanks for your time. --Geographyinitiative (talk) 18:29, 20 June 2021 (UTC)[reply]
Imagine what it's like being the one to have to reply and address all parts of it. Gets exhausting, frankly. The Nicodene (talk) 18:32, 20 June 2021 (UTC)[reply]
Nitpick: Doesn't Daco-Romance come from Moesia? --RichardW57 (talk) 18:40, 20 June 2021 (UTC)[reply]
Yes that is one of many issues with the above. It is assumed that Romanian must have originated north of the Danube even though there exists a considerable body of evidence to the contrary. Even if it had, though, that does not mean that linguistic contact between the ancestors of the Romanians and other Latin speakers had to cease the moment that Aurelian abandoned the province: the Danube is not going to prevent ongoing trade, proselytism, etc. The collapse of Roman rule throughout the interior of the Balkans in the early seventh century, in the face of the Slavic-Avar invasion, appears to be the turning point. The Nicodene (talk) 18:53, 20 June 2021 (UTC)[reply]
@RichardW57: I'm not informed on that question, although I'm currently reading something that mentions it, at it looks to be a distinct possibility. I'm very cursorily familiar with Rumanian. If I decided to take Geographyinitiative's advice, I might have substituted that whole part with: "Hadrian (76–138 CE) is reported to have had a Spanish accent and Septimius Severus (145–211 CE) a distinct African accent (his sister couldn't even speak Latin so was sent away from Rome as a disgraziata). Clearly no supra-regional prestige proto-Romance phonology gave rise to Ibero-Romance, nor to African Romance. Augustine was mocked for his accent when in Italy, presumably in early 5th century Milan, to no surprise of mine. One possibility is that somewhere between the early 5th century and the break up of the empire, a supra-regional prestige proto-Romance phonology took root across the whole of the empire. The other is that there never existed a supra-regional prestige phonology save for the upper-class Roman speech of the 1st century BCE, aka Classical Latin." Russell's teapot continues to elude us. Brutal Russian (talk) 19:58, 20 June 2021 (UTC)[reply]
It is rather that by the fifth century at least two clear regional differences had developed in the pronunciation of Latin, as demonstrated by Adams (namely the extent of betacism and treatment of the original Latin /ĭ/). That much is not controversial.
The phonological system reconstructed by modern scholarship (and labelled as 'Proto-Romance') existed prior to that. The Nicodene (talk) 20:27, 20 June 2021 (UTC)[reply]
Septimius Severus lived in the 2nd century and had an African accent. Hadrian had a Spanish accent in the first century even despite being from a patrician Roman family settled in Iberia shortly after Scipio Africanus' conquest of Italica (206 BCE). There already existed a distinct Spanish Latin in the 1st century CE, and nothing seems to show that it ever went away. The sociolinguistic situation in the Roman empire seems to have been highly complex at every single point in time, and this clashes with notion of a single supra-regional Late Latin/proto-Romance in much the same way it clashes with traditional Vulgar Latin. Brutal Russian (talk) 21:35, 20 June 2021 (UTC)[reply]
Variation in regional accents certainly existed at all periods, as in any sufficiently widespread language, including in the Classical period. That does not prevent linguists from reconstructing a relatively 'Late' pronunciation any more than it prevents them from reconstructing a coherent Classical pronunciation, which you have never once questioned. Note that the pronunciation of Rome was prestigious in the Late period, as Adams (2008: chapter III § 1.2) demonstrates, just as it had been in Classical times. The same author also debunks claims of an early distinctive 'Spanish Latin' (pp. 370–431), among numerous other topics relevant to this discussion. The Nicodene (talk) 23:08, 20 June 2021 (UTC)[reply]
@Brutal Russian This will come as no surprise to you, but I disagree with every last paragraph of your gargantuan rant. I do not, however, want to spend the rest of the day systematically dissecting the entirety of it and citing all the contradictory evidence. I have other things to do in life as well. I suggest you pick a specific section of it, and we can start from there. Once that is done, give it at least a few days, and we can move on to the next part.
By the way, if you want to claim that someone is bullying you, it doesn't help your case when you're the one constantly writing new rants like these in 'public' discussion venues and calling the other person ignorant, a narcissist, etc. and even trying to get them banned. The Nicodene (talk) 18:43, 20 June 2021 (UTC)[reply]
@Brutal Russian Here is what I would like to say in your response to your 'rabid dog' comparison.
You were toxic to me in the very first discussion we had on this website, without the slightest reason to be, and you have never stopped being so since that moment. (You even continue to insult me on this page.) How is it that you cannot see that you, the one regularly attacking me on new discussion pages and now even attempting to ban me, are not the bully?
I barely have the energy to do even a quarter of the work I used to do on this website, thanks entirely to you. You have succeeded in turning my experience here into a living misery. I hate logging on here now, because I have to constantly deal with the nauseating feeling that you are about to launch yet another tirade against me in some new corner of the website. Let me be even more honest (at the very real risk of you using this as ammo to ridicule me): I suspect I am developing a form of PTSD from your constant attacks. If I put on a façade of unaffected bravado, that is entirely for the benefit of the discussion. The Nicodene (talk) 09:57, 21 June 2021 (UTC)[reply]

Regarding an old discussion about allophones of /ll/

Today I will be focusing on one topic out of the lengthy rant that @Brutal Russian has posted above.

It relates to an old discussion we had about the topic mentioned in the title of this post, which was also the first contact that this user and I had ever had on Wiktionary.

As you can see, the discussion was civilized until he said “[you’re] simply muddying the waters while milking me for knowledge/conviction points” and “In fact, let me ask you directly: had you read anything on the topic besides Vox Latina before starting this exchange?”

He insinuated, in that paragraph, that I was ignorant, and that he, the all-knowing one, was annoyed by my having a discussion with him. I had done nothing to deserve this behaviour from him, yet he threw it at me anyway, in the process rendering void any future claims of innocence and victimhood.

This attitude of his might have been, at least, slightly less egregious if he had been right about what he was saying, but in the end he was not.

He appears to challenge the basic fact, which any linguist is aware of, that narrow transcriptions vary in their degree of precision. Moreover, few such transcriptions attempt to exhaustively transcribe all of the allophonic features of a given segment; that is especially so for reconstructions of extinct pronunciations, where doing so becomes exponentially more complicated.

He claims that that Classical Latin /l/ before /e/ was “non-velarised”, yet his own source (Sihler) clearly states that it was, just to a lesser degree than before /a/ or back vowels. (Just because a feature is underspecified does not mean that it does not actually exist.) This is supported by the scholars Scen and Weiss, who, like Sihler, out that /l/ was ‘pinguis’ in the environment in question. The Nicodene (talk) 19:54, 20 June 2021 (UTC)[reply]

I said: "When you're suggesting transcriptions devoid of meaning, arguing that we shouldn't aim for precision because some transcriptions are just wrong, to me it seems like simply muddying the waters while milking me for knowledge/conviction points". You're chopping my words to misrepresent my tone and intentions. With the full quote it's obvious that I'm being duly polite and circumspect. I'm politely but firmly letting you know that your behaviour is inconsistent with what I understand to be a bona-fide discussion with the aim to reach consensus, but that you seem to have a pre-conceived agenda to disagree with me. It's silly to pretend now that I wasn't right, and that I didn't burst your pretence bubble with that remark, which made your blood boil. Nothing stings one's ego more than truth, and the sting is felt even by those who lie to themselves about what they're doing.
Your next-to-last paragraph is precisely what I call muddying the waters and milking me for conviction points. There is no facts in that paragraph - the is only whataboutism of a somewhat higher level as "nobody has heard recordings of Classical Latin so we don't know cēna wasn't pronounced as in Roman Ecclesiastical - at least have the humility of not claiming that it wasn't!" or "you haven't seen the Earth from space so it's difficult for you to claim it's oval - at least have the humility of not claiming that it's not flat!" - you get the gist. People who use that tactic aren't people I want to have a discussion with. I have provided you with a detailed phonological study right from the very start - Sen 2015. I did not speicifally expect you to understand it without prior training in autosegmental phonology - but I did expect you to not argue about something you don't understand, a reasonable expectation I and other reasonable people apply to myself and everyone else.
Sihler can state anything he likes, but his statements cannot possibly disprove the study I cited. His statements don't have magical powers to disprove a phonological paper. Weiss treats the question of its quality before /e/ in a footnote - and that footnote doesn't disprove anything in Sen's 2015 study, the screenshot of which you took yourself. Weiss offers different conclusions that he borrowed from earlier non-phonologica studies. Is Sen the same person as your Scen? If so, then one and a half year ago you continue talk about the very study from where you took the screenshot that says what I say it says, and claim that it contradicts what I say it says. This is what made me decide you were one of several things neither of which I wanted to have anything to do with, and I continue to see confirmations for this. Why are you saying that what's written is not what's written? How is one supposed to discuss such intricate matters with a person who is either unable to understand even an illustrated phonological description, to agree even about what the English text says?
If the situation has improved since, and you instead mean Cser, in his 2020 book he writes "it was velarised before consonants, velar vowels and possibly [e]", presumably basing this on the same Sen 2015 which he cites elsewhere albeit not for this statement; and again his is not a phonological study of the Latin /l/ and does not disprove anything. The bottom line is that you're throwing generic references at me that make generic statements based on generic musings instead of providing anything relevant to the specialised phonological study that I cite. You provide no evidence - your own or external - that questions Sen 2015's scholarship or conclusions in any way. You seem to not understand how a scholarly exchange of knowledge works.
This is what I call throwing references at people, muddying the waters, whataboutism, having no knowledge or understanding but still disagreeing. Why? Why, to disagree of course! Once you've started a war, you don't want to just stop the barrage just because your missiles hit a brick wall, right? This is what I mean when I say that you use references as binary crutches regardless of whether or not you understand anything written inside. It's like a soldier not looking inside the rocket in their rocket launcher - as long as it destroys the opponent, it's all good and fair. Fire away and don't worry about it. And in that discussion you continued doing exactly the same with the fortis [l͈]. This is when I concluded I wasn't taking part in a discussion between two intelligent people. As is obvious by the fact that I didn't know who I was conversing with, I did that purely on the merits of your conduct, without any personal preconceptions against you, and the same happened on previous occasions with you using different aliases.
Do you still, despite my explanations on the talk page and in the longread above, not understand that the sound Sen 2015 puts between the patalatised [lʲ] and the velarised [ɫ] is good old IPA [l]? Do you not understand that its understanding the phonological description that allows one to conclude what the author is talking about, and that once you have the phonological entity, it's irrelevant to you whether calls that entity "dark", or comes up with adventurous transcriptions like [l°] (which I probably would have used it was standard IPA)? Brutal Russian (talk) 21:07, 20 June 2021 (UTC)[reply]
@Brutal Russian The fact that you interlaced your rudeness and insults with polite-sounding language does not suddenly mean that they did not exist. The fact remains that you were toxic to me without the slightest provocation, and anyone reading this will be able to see that. The fact that you continue to insult me in this most recent comment by calling my ego fragile and calling me ignorant for the umpteenth time ("having no knowledge or understanding") is just further confirmation that you are incorrigibly toxic.
(I do not understand your preoccupation with my 'aliases', by the way. Previously I, like many people, perhaps most people, had different usernames on different websites. Recently I decided to harmonize them all to Nicodene. For Wikipedia that name was already taken, so I had to add 'The' before it. None of this has anything to do with you.)
Back to the matter at hand: if you believe that narrow linguistic transcriptions all must transcribe every single allophonic feature then, I am sorry, but you are simply mistaken about this. Look at any narrow phonetic transcription of any word in any source, particularly ones of dead languages, and you will see that they only transcribe a certain number of features. If they covered everything, they would have to always specify tone (even for non-tonal languages), secondary/tertiary stress, minute degrees of asperation/palatalizion/velarization/retraction/nasalization, etc. The resulting transcriptions would be, to put it simply, absolutely chock-full of diacritics: at least one on every single symbol. Such exhaustive transcriptions are occasionally done, but they are the exception, not the rule.
In point of fact, extremely narrow transcriptions are often given in double square brackets, rather than single ones. Please enlighten me about how this would be possible if all narrow transcriptions must be maximally precise.
According to the ‘super-detailed table’ that you reposted, Sen (2015: 33) is quite clearly and unambiguously saying that /l/ before long or short /e/ was “dark” and “underspecified for back”, just like /l/ was before long or short /a/ or /o/ (which you yourself had transcribed as [ɫ]), just to a lesser degree. Meanwhile you transcribed /l/ before /e/ as a simple [l], which is in direct contradiction to what Sen is saying. Before you attempt to claim that I am misreading the table, refer to the following comment he makes on page 28: "To conclude, /l/ before /e:/ was dark enough to trigger backing to /o/, whereas /l/ before /e/ was darker, triggering backing to /u/", and on page 16 he adds that "Traditional grammars disagree as to which variant appeared before /e/, but colouring indicates that /l/ was relatively dark in this environment.”
Sihler (1995: 174) says that “The distribution was as follows: l exilis was found before the vowels -i- and -ī-, and before another -l-; l pinguis occurred before any other vowel; before any consonant EXCEPT l; and in word-final position.” On page 41, in a parenthetical comment, he specifies that what he means by l pinguis is, in fact, velarized l: "Before a velarized l (that is, l pinguis, 176a)".
Weiss (2009: 82) specifically says “In Latin, l developed two allophones: a non-velar (possibly palatal) allophone called exīlis be­fore i and when geminate […] and a velar allophone called l pinguis elsewhere […] The one slight surprise in this distribution is the fact that l is pinguis even before e, e.g., Herculēs < Hercolēs".
There is no ambiguity here. The sources directly contradict your transcription [l]. The Nicodene (talk) 22:18, 20 June 2021 (UTC)[reply]
To be even more explicit on the last point: [l] specifically refers to clear l, not to dark l. The latter is defined as being, to a greater or lesser degree, +back/'dark'/velarized. This can be seen by looking up the definitions or even just descriptions of clear l and dark l in any scholarly work. The Nicodene (talk) 02:58, 21 June 2021 (UTC)[reply]

Regarding formaticus

I did say that I wanted to leave 'at least a few days' between such discussions, but I will contradict this, for the moment.

The user @Brutal Russian posted a rant above in which he alludes to a discussion we recently had about the word formaticus.

In his recent comments, as well as his older ones, he repeatedly claims that using the term France for the late eighth century is against ‘scientific usage’. This claim is simply not true: modern scholarship prefers the term France, as can be seen by the fact that Google results for "Carolingian France" outnumber those for "Carolingian Gaul" by a ratio over six-to-one, as of right now (searching both terms with the surrounding quotation marks). I did, perhaps, overstate the case, but the basic point I was making remains true. Nevertheless I decided to go with the phrasing "what is now France" in the end in order to avoid a continuation of this frankly pointless argument over labels. Apparently, though, he cannot bring himself to let go of it.

He insults me, in the rant above, by saying that I show a ‘complete ignorance’ (I have lost track of the number of times he has called me 'ignorant') of the fact that the Latin word Gallia never truly died out, even in the early Medieval Ages, in the most polished, classicizing varieties of Latin, and that modern linguists use such terms as Gallo-Romance to describe the language family that includes modern French, Occitan, and Arpitan. I am well aware of both facts: neither contradict the fact that modern historians prefer to speak of Carolingian France. (Not to mention, of course, contemporaries.)

He mentions that I removed the caseus formaticus conjecture. That is true, I did, but I eventually came around to including it, this time clearly wording it as a conjecture rather than an indisputable fact, since caseus formaticus is not actually attested, nor is the fact that formaticus shows up as a masculine proof of such an ellipsis, cf. the dozens of Merovingian and Carolingian-era coinages ending in -aticus, many examples of which I provide in the discussion, none of which are the products of an attested ellipsis.

Speaking of the latter, he claims that I "conflat[ed] attested evidence and conjectured forms". That is not true: he simply misread what I really said. I quite clearly described coraticus, etc. as the etyma of various Old French words in the original comment. The fact that he apparently understood etyma to mean 'attested written forms in Latin' is his own fault, not mine. I did, by the way, provide a total of nineteen unambiguous attestations of such masculine forms (with -aticus/os/i), and I linked a page on which Du Cange lists several more. I could, if needed, expand the total to, say, sixty with a few more hours of research, and perhaps a hundred given an entire day to do it, but this would be a waste of time on my part: no amount of evidence would ever lead 'brutal russian' to concede the point. Nineteen sourced attestations, plus a link to about a dozen additional attestations by Du Cange, is plenty for any remotely reasonable person.

In response to my pointing out that nobody prior to the ninth century distinguished Latin and Romance as separate languages, he says that I am “appealing to nothing but the linguistic intuition of medieval Franks to define what is and isn't Latin”. This is not the case: it was, as I pointed out in the discussion, not just ‘medieval Franks’: nobody, no matter whether they were a native speaker or not, no matter which century they lived in (as long as it was prior to the ninth) ever referred to Latin and Romance as separate languages. As elsewhere, 'brutal russian' is twisting my words here.

He implies that I “have no familiarity with the modern linguistic side of the question”. This is just more toxic slander on his part: I am perfectly aware of the numerous opinions and arguments both in favour of the old diglossic model (which is very much becoming a minority view in the latest scholarship) as well as arguments against it.

He says that he revealed “information [that I was] ignorant of” in reference to his claim that spoken Latin lacked a nominative case in inanimate nouns. The source he cited, Ledgeway, says absolutely nothing of the sort, nor does any other source, and furthermore the fact that Old French and Old Occitan both had nominative forms of inanimate nouns is in direct contradiction to that claim.

(Here 'brutal russian' thinks that I intentionally deleted his comment. I did not: I simply replied to it, then sighed at the thought of developing yet another branch of an already massive and over-complicated discussion, and decided to delete my own comment. It seems I deleted his comment together with my reply. That was not intentional on my part whatsoever. Nor is it an example of him actually being right about the topic that was being discussed: as I have explained here, and as I explained in more detail on the discussion page for formaticus, he has completely misread Ledgeway, who in no way makes the unusual claim that 'brutal russian' somehow divined from his work.)

As for ‘Romanistics’ it is, at best, an extremely obscure term. Moreover, even when it is used it seems to often refer to the study of Roman culture, law, or other such topics. In the work titled Lesser-used Languages and Romance Linguistics (which is, incidentally, the very first result on Google Books when one searches ‘romanistics’) it is quite plainly stated that ‘the word Romanistics is regrettably not current in English’. Many, perhaps nearly all, of the ‘linguistic’ results seem to just be used in reference to German or Iberian scholars as a literal translation of terms found in their native languages. Neither Merriam-Webster nor the Oxford English Dictionary, moreover, even mention that the word exists in English. If one searches for “romanistics” on Google (with the quotation marks) one is met with less than six thousand results, versus two hundred twenty thousand for “Romance Linguistics”, a ratio of of more than thirty six to one. The field is most certainly not called ‘Romanistics’ with any frequency at all by British or American scholars; the term which does not appear in any standard reference work written by them, such as the Oxford Guide to the Romance Languages or either volume of the Cambridge History of the Romance Languages.

‘Brutal russian’ accuses me of narcissism, which is an odd claim. I have no problem with admitting when someone else is right about something, and have done so, whenever it was actually the case, in my discussions with him. By contrast he refuses to concede any point, no matter how many sources are arrayed against him. I have never once heard him say anything to the effect of ‘Alright, that was true. I was mistaken about that.’ Refusal to admit any wrong is a major highlight of narcissism; so, incidentally, is wanting attention and pinging half of Wiktionary to witness your attempted ‘epic beatdown’ of somebody. So is being rude and toxic without the slightest provocation, as he was to me the very first time we ran into each other on Wiktionary.

One valid criticism he has made in this latest discussion (note how I, unlike him, admit when what the other person says is actually true) is that I have a habit of hitting ‘submit’ too early and then seeing numerous things that I want to revise in my comment. I will make an attempt to do that less.

'Brutal russian' claims that I am making a strawman when I point out that he believes formaticus existed in the Classical period. I will concede that it is true that he never explicitly said it did. If he truly never believed that it did (I continue to have my doubts), then good: I no longer have to spend energy arguing for the obvious.

I did point out, multiple times, and citing several elementary mistakes he has made, that he is a beginner in Romance Linguistics. It is strange to me that he would complain about this for two reasons:

1) That is objectively true. Nobody informed about Romance Linguistics would fail to know where the rusticam romanam linguam quote is from, nor would they claim that inanimates lacked a nominative in the sixth century, nor would they claim that a complete merger of /b/ and /w/ took place in Pompeii, and so on.

2) The comment in question came a full two days after he claimed that I was ignorant of Latin, during a bizarre rant about Nazis, “peeing on scooters”, and “meme-shitters”. (Yes, he really said all that. In the Tea Room, no less.) The quote in question was “[you can’t] assert your authority in matters of a language you can't even speak”, although even prior this he had repeatedly insinuated that I do not even know Latin. Needless to say, I can speak Latin: I have studied it for over fifteen years.

Relative validity of the comments aside, one should not complain when they slander someone and the other later returns precisely the same favour. The Nicodene (talk) 02:25, 21 June 2021 (UTC)[reply]

Regarding 'Campanian Latin'

Now I would like to address the matter of ‘Campanian Latin’.

Ah, where to begin.

The first draft ‘brutal russian’ made of this ‘Campanian Latin’ (which is apparently meant to reflect the speech of Pompeii, so up to the year 79 A.D.) included such adventurous features as a complete, universal lowering of Latin short /i/ to [e], a complete merger of /w/ and /b/ in all positions into the voiced bilabial fricative, and a nasalization of every vowel before a nasal consonant in syllable coda position (not just word-finally).

The only source he provided for these, or any other feature since (that was not already found in the Classical module) was a single citation of Adams, regarding the single matter of Latin short /i/, in which he apparently missed the point that the phenomenon in question only occurred with any regularity in final syllables, generally in verbs. I think this is about the fourth or fifth time I am explaining this (see Adams 2013: 58-61), but I don’t see any sign that he has actually understood it. In his most recent edit to the pronunciation model, he attempted to resurrect the feature exactly as he had tried to implement it originally.

If he really wants this ‘Campanian Latin’ to happen, he needs to provide sources that actually support all of the non-CL features that he wants to assign to it. So far he has not done so, apart from the one mistaken example I have mentioned.

Not only that, but in this latest rant of his (which can be seen above) he even ridicules the idea of backing up one's arguments with sources. He apparently thinks the fact that every single feature that I have added to my Late/‘Vulgar’ transcriptions is thoroughly cited, according to two or more reliable sources (all the features in question are sourced here and here) is some sort of childish debate tactic rather than a legitimate argument in favour of the transcription.

Moreover, he has not explained why his 'Campanian Latin' is in any way appropriate for reconstructed Proto-Romance lemmas on Wiktionary, which is what the 'vulgar' module is used for. Needless to say, a reconstructed Proto-Romance pronunciation is what is most appropriate for reconstructed Proto-Romance words. The Nicodene (talk) 06:42, 21 June 2021 (UTC)[reply]

In fact, let me propose a solution for this, @Brutal Russian.
If you make a separate sub-module for your experiment (leaving alone the 'vulgar' one that is primarily used for reconstructed Proto-Romance lemmas), and if you correctly* cite all the features you want to put in your ‘Campanian Latin’, I promise to never touch it.
*Not like that mis-citation of Adams (2013). The sources need to actually support what you’re saying. The Nicodene (talk) 07:50, 21 June 2021 (UTC)[reply]