User talk:CodeCat

Definition from Wiktionary, the free dictionary
Jump to: navigation, search
Archives: 2009-2010 · 2011 · 2012
Start a new discussion


Thread titleRepliesLast modified
twee handen op één buik202:40, 22 December 2014
{{la-IPA}}/Module:la-pronunc222:37, 21 December 2014
MewBot100:04, 17 December 2014
[]113:46, 10 December 2014
{{diff|31047830|31029876|improper adjustment of inflection tags}}014:21, 8 December 2014
"volo" deletus est?118:08, 1 December 2014
Gender of a Middle Low German word401:58, 1 December 2014
Category:Requests for etymology208:25, 29 November 2014
*flauhs or *flauhaz?202:07, 29 November 2014
Categories that have failed RFDO301:47, 25 November 2014
Term placeholders/requests1000:00, 25 November 2014
politically correct as a politically correct term219:30, 24 November 2014
Something for "News for editors" for November216:14, 21 November 2014
etymology citation423:24, 19 November 2014
{{temp|hu-suffix}} - Part 2813:36, 19 November 2014
Reverting my edition120:51, 16 November 2014
How are you speeding up MewBot?215:30, 13 November 2014
Would the affix template you suggested work for this word?914:40, 12 November 2014
Sami/Finnic correspondences223:59, 8 November 2014
Sami/Finnic correspondences023:43, 8 November 2014
First page
First page
Previous page
Previous page
Last page
Last page

Could you have a look at this, maybe? The etymology is a bit strange, although I can't say that it's wrong. I've never heard the expression, actually. But the etymology given is based on some nobody's opinion in an internet forum. That would have to be deleted at any rate, I guess. The legitimate source from onzetaal says nothing about "beuk", nor does this:

Kolmiel (talk)00:52, 16 December 2014

I don't really know anything about it either.

CodeCat01:16, 16 December 2014

Ok. In that case I'm deleting the etymology. It's based on useless sources and more reliable sources mention nothing of the same kind.

Kolmiel (talk)02:40, 22 December 2014

I just want to thank you for your edits last month to this module! I'm just learning Latin and IPA and I find it to be incredibly helpful. When I first came to Wiktionary I was annoyed about the short vowels' (especially [ɛ], [ɪ]) being imprecisely represented as, for example, /e/ and /i/, when Vox Latina and other places on Wikipedia say they have different quality, like the English short vowels. Now Template:la-IPA shows exactly what I expect, including smaller phonetic features (like ae̯ and ŋk, which are hard for me to notice with aural learning). Is there any plan to use a bot to phase out the old pronunciation templates? ~hint~ ;-)

Nice work.

P.S. Out of curiosity, what were your source(s)? Vox Latina or another book or website?

Philosophus minor (talk)03:01, 12 December 2014

I didn't actually make the template, I just improved it some. It already did all those things before I started, so I don't know.

CodeCat14:12, 12 December 2014

Sorry, my mistake! I see now that you didn't change the pronunciation. I assumed, once I saw the template after your edits, that you had improved it's functionality, because I saw a lot of differences in the examples compared to what I've seen on actual word pages. Instead, it must be that very few of them use la-IPA, as seen in this search (6,563 pages matched) (compared to about 3,000 that use la-IPA), and so there are differences in how they are rendered. I think I've seen pages that translate ⟨ae⟩ as /aj/ not /ae̯/, which becomes confusing because the latter is used by the template but not used elsewhere.

Would it be possible then to have all Latin pages use la-IPA, so that precise IPA is given for them? I think that would make it much more consistent, and short vowels are the main thing that would be improved. The template should work for >90% of cases, but words like maior need to actually look like maijor.

Philosophus minor (talk)22:37, 21 December 2014

Greetings! Just wondering if MewBot is still operational and if you had any intention of running it in the near future. The FeedMe page is getting quite long by now. Cheers.

Tempodivalse [talk]23:15, 16 December 2014

It's one of those things that I know still needs to be done, but keep... not doing? I don't know. Part of it is that the script I use to run it needs some serious rewriting, which just makes it less appealing to do.

CodeCat00:04, 17 December 2014


p.s. what is actually the difference between {{term}}, {{t}} and {{m}}? All three occur pretty randomly in etymologies from what I can see (apart from {{t}} for proto-languages)...

Hyarmendacil (talk)03:04, 10 December 2014

{{t}} is for translation tables, it shouldn't be used anywhere else.

The difference between {{term}} and {{m}} is only in the way the parameters are specified: the former takes the entry to be linked to as the first parameter and the language with lang=, while the latter takes the language as the first parameter and the entry as the second. {{m}} takes the same parameters as {{l}}, in other words; that's why many people prefer it. But there is still some disagreement over which one of the two to use, so everyone just uses whichever they prefer. My own preference is for {{m}}.

CodeCat13:46, 10 December 2014

{{diff|31047830|31029876|improper adjustment of inflection tags}}

Please stop your bot doing this.

 — I.S.M.E.T.A.14:21, 8 December 2014

"volo" deletus est?

Hi MewBot:

I love your work and use it daily to help me learn Latin- thanks! I cannot seem to get the entry for "volo" to show - perhaps it's been deleted (?). Can you recover? Fyi and thanks!, 28 November 2014

Do you mean the volo entry? That's appearing just fine for me, and the history shows that the Latin entry has been there for a while. Hope that helps!

‑‑ Eiríkr Útlendi │ Tala við mig18:08, 1 December 2014

Gender of a Middle Low German word

Hello, CodeCat,

I hope that you are doing well! :) I have a question, which is actually the gender of Middle Low German havene?

Greets HeliosX

HeliosX (talk)13:50, 24 November 2014

I would guess that it's feminine but I don't know for sure.

CodeCat13:50, 24 November 2014

This is interesting, because its Germanic root was feminine, but its New High German descendant is masculine. There is a New High German word Hafen m (pot), from Old High German hafan m (pot); perhaps it influenced the gender of Hafen (harbour)?

Kluge's old Etymologisches Wörterbuch says "the Middle High German habene [feminine]", the immediate predecessor of New High German Hafen (port), "corresponds phonologically to Dutch haven [feminine] (late Old English hæfene [and] English haven, maybe loaned from Old Norse hǫfn [feminine] "harbour"). Low German haven, Danish havn, Swedish hamn are [masculine]. ("Dem mhd. habene F. entspricht lautlich ndl. haven F. (spätangls. hæfene engl. haven, wohl entlehnt aus anord. hǫfn F. "Hafen"). Ndd. haven, dän havn, schwed. hamn sind M.")

De Vries' NEW entry for haven says "haven znw. v., mnl. hāvene, hāven v., mnd. hāvene v. m. ( > nhd. hafen m.)".

- -sche (discuss)00:54, 25 November 2014

Okay, thanks! :)

Listing up the etymology of New High German Hafen is this right?

*habnō, Proto-Germanic --> hāvene, Middle Low German --> Haven, Low German --> Hafen, New High German

Actually, I encountered multiple possible etymologies for the word, so I am trying to figure out, which one is correct.


HeliosX (talk)16:35, 29 November 2014

The date at which High German borrowed the word is far enough back that it was technically a borrowing from Middle Low German (into Middle High German), but that is otherwise correct. I've updated Appendix:Proto-Germanic/habnō and Hafen to show the sequence.

- -sche (discuss)01:58, 1 December 2014

*flauhs or *flauhaz?

I am not sure whether or not to move Appendix:Proto-Germanic/*flauhaz to Appendix:Proto-Germanic/*flauhs. Vladimir Orel 2003 reconstructs this noun as *flauhs, but Kroonen 2013 reconstructs the same word as *flauhaz. How do you explain umlaut in German Floh/Flöhe and Icelandic fló/flær? Is the plural vlooien of Dutch vlo also umlauted or is the /ooi/ pronounced as a true dipthong? (I'm not familiar with dutch orthography). I can neither find any good Middle Saxon nor Middle Dutch dictionaries in English to see if the noun was also umlauted in those languages as well (note the asterisk next to the Old Saxon and Old Dutch entries).

Nayrb Rellimer (talk)19:06, 28 November 2014

Dutch -ooi- represents a diphthong, yes. Dutch regularly inserts /j/ in a hiatus.

Umlaut in the plural doesn't have to be because of the consonant stem plural ending -iz. The i-stem ending -īz and the u-stem -iwiz can also trigger umlaut.

CodeCat00:31, 29 November 2014

I am aware that -īz and -iwiz also can cause umlaut in the plural, but -ōz and -ōs do not cause umlaut in these languages so this entry needs to be updated to explain the umlaut in both the descendant languages. The main issue is that I can't understand German or Dutch so I can't read most Middle Low German or Middle Dutch dictionaries.

Nayrb Rellimer (talk)02:07, 29 November 2014

Categories that have failed RFDO

I haven't tracked down the discussion, but both Category:Lojban nouns and Category:Lojban proper nouns were deleted as having failed RFDO. Aside from the merits of this individual case, I think we need to come up with a way of preventing Mewbot from creating categories that are validly formed, but shouldn't exist (to use an absurd example: Category:Chinese class 1 strong verbs). Maybe an exception list?

Chuck Entz (talk)16:08, 23 November 2014

If MewBot doesn't create them, it doesn't stop someone else from doing so anyway. So the list would probably need to be built into the category tree modules somehow, so that the module can automatically tag or trigger an error when they get created.

CodeCat16:39, 23 November 2014

The obvious, simple, low-tech solution is to make a page that links to all of the categories that have failed RFD, a bit like WT:RFVA, and periodically check for blue links.

- -sche (discuss)01:21, 25 November 2014

That works too. But harder to maintain.

CodeCat01:47, 25 November 2014

Term placeholders/requests

Re "Showing a request for a term is pointless. How are people supposed to know what term is meant?": From context. Or from any other information that may be indicated, such as the translation. See Zira.

WikiTiki8920:57, 12 November 2014

I can't tell from context what is meant there. It seems more like a crossword puzzle to me. "Guess the word that means X and sounds like Y." Besides, the module is used very generally; we don't want to remove the error checking just for a few specific cases. It's not worth it.

CodeCat21:00, 12 November 2014
Edited by author.
Last edit: 21:05, 12 November 2014

That's the whole point. You might not know the word, but someone else who stumbles upon the page (perhaps by browsing through the category) might know. Either from the idea that it has to be similar to "Zira" or from the translation, or both. If it's an error, then it would still be caught by the editor or whoever is browsing the category. And who ever said it's only for specific cases? This could be used very widely. It hasn't been only because the template hasn't allowed it.

WikiTiki8921:02, 12 November 2014

And what if it isn't?

CodeCat21:04, 12 November 2014

Not every error can be caught. One error I frequently make is {{m|he|מַיִם|máyim}}, which displays as: máyim. If I don't catch it when I'm editing, someone else will when they are browsing the page.

WikiTiki8921:08, 12 November 2014

One tweak I would suggest is checking the namespace: people try out all kinds of nonstandard kluges in their user spaces- do we really want those cluttering up the request categories?

Chuck Entz (talk)03:46, 16 November 2014

politically correct as a politically correct term

I feel that that entry and category need to be linked in some way. Doesn't really matter how, but in some way. Note about the category in the entry and link to the category in the lede are two other ways that could be open if we don't want to categorize politically correct as a politically correct term.

Purplebackpack8900:10, 21 November 2014

I can see your point. If this were a topical category, that would make perfect sense, but topical categories link terms about something, while this kind of category links terms that are something. It would be like adding nominal to Category:English nouns (assuming, for the moment, that there weren't a separate noun sense): yes, nominal is about nouns, but it's an adjective.

Chuck Entz (talk)15:00, 24 November 2014

Couldn't we, for example, get the cat head to read "Terms related to being politically correct?

Purplebackpack8919:30, 24 November 2014

Something for "News for editors" for November

Tweenk (talkcontribs) created a new template, {{pl-decl-noun-cie}}, for Polish nouns ending in -cie because I somewhat requested it. Does the creation of a new template like that count as news for editors?

Lo Ximiendo (talk)05:09, 21 November 2014

I think so. There's no harm in putting it there, especially as there isn't much other news.

CodeCat10:12, 21 November 2014

I added the news myself today. Maybe for the first time ever.

Lo Ximiendo (talk)16:14, 21 November 2014

etymology citation

Hi, you recently deleted the triangular hebesphenorotunda citations page. The citation I gave illustrated the etymology in more detail than seemed appropriate on the primary definition, but is nonetheless interesting and relevant information. May I ask where else I should put that quote?

Origamidesigner (talk)20:10, 19 November 2014

The problem is that the citation does not contain the term "triangular hebesphenorotunda" at all. Citations are supposed to show the term as it is used.

CodeCat20:17, 19 November 2014

Ok, but it explains the etymological roots of the term in detail, attributed to the guy who coined the term. I'm not sure where else to put that information, maybe I should just integrate it into the etymology section?

Origamidesigner (talk)22:41, 19 November 2014

Or maybe that level of detail isn't necessary, and I should just forget about it?

Origamidesigner (talk)22:43, 19 November 2014

It's probably not necessary. Our etymologies mostly focus on how the word came to exist, and leave out anything else.

CodeCat23:24, 19 November 2014

{{temp|hu-suffix}} - Part 2

There is another issue with the new hu-suffix: In certain cases, we want to use the template with a blank first parameter, just to show the suffix and put the word into the correct suffix category. See dinamikus. The terminology in these cases is with -xx ending. This worked fine in the original version, but the new version adds a plus sign: with + -xx ending which does not make sense. I'd like the restore the original functionality.

Panda10 (talk)14:19, 15 November 2014

Why is Hungarian so unique that it needs all this extra stuff that other languages don't need?

CodeCat16:43, 15 November 2014

I can't speak for other languages because I don't know them well. I think it would be a simple change for you. Can you just add it?

Panda10 (talk)17:45, 15 November 2014

Why not remove the "with"?

CodeCat17:49, 15 November 2014

Because the + sign is superfluous in these cases. It is not a root + suffix.

Panda10 (talk)17:54, 15 November 2014

Can you give an example of where this is used? I don't quite understand it right now.

CodeCat17:55, 15 November 2014

Reverting my edition

What is the reason for reversion of my edition? What did I do wrong? Thank you.

Viethoc (talk)20:50, 16 November 2014

You are writing the definitions in Vietnamese. But this is an English dictionary, the definitions should be written in English.

CodeCat20:51, 16 November 2014

How are you speeding up MewBot?

I notice that MewBot saves on the order of 20-30 changes per minute. I'm using a modified version of your blib for my bot, but every time it saves a page, it sleeps on the order of 5-8 secs, and I end up with only 5-7 changes per minute. This is using the function. Apparently you must be disabling throttling or using some API that doesn't throttle, or at least throttles less? t notice some scripts using page.put() or page.put_async(), and the code text = page.get(throttle=False), but I'm not sure how to directly control throttling, nor am I sure what the difference between, page.put() and page.put_async() is. Any help would be greatly appreciated.

Benwing (talk)07:12, 13 November 2014

OK I think I figured it out, reducing the values of put_throttle, maxlag and maxthrottle in What settings do you use?

Benwing (talk)07:57, 13 November 2014

I only use put_throttle=1. Some people here think that this overloads the server, but I've been doing it a long time and I never noticed anything. Surely the servers are robust enough to handle one more request per second?

CodeCat15:30, 13 November 2014

Would the affix template you suggested work for this word?

Edited by another user.
Last edit: 02:06, 11 November 2014

Recently I asked Chuck Entz for advice on how to set up the etymology for a word.

You suggested the following: An alternative is to use {{affix}}, a recently created template. It detects the type of affix (base, prefix, suffix, interfix) based on the parameters given, so there is no ambiguity. —CodeCat 01:59, 30 October 2014 (UTC)

Today I posted a word that has two suffixes. My language (Wauja) uses suffixes quite exuberantly, and some words have three in a row, or more. Here is the word: The etymology that you see there now is just a placeholder, to show you how I would like it to be displayed to the visitor. But it's just manually formatted, which is not what I want because it won't add the example to a list of entries for that suffix. Could the affix template you mentioned serve in a case like this?


Emi-Ireland (talk)01:56, 11 November 2014

I've now added the template to the entry. I've also added a few others. You can see what I changed by viewing the history of the entry and comparing my edit with the version before it.

CodeCat02:06, 11 November 2014

Wow! Thanks! Very interesting. Much more efficient. Several questions:

  1. Is ux|wau|etc. what I should use for all my entries? Is the way I have my entries now deprecated?
  2. And what's the purpose of the "1" in this? l|wau|yamukutai||child
  3. Is there a write-up explaining what t1, t3, and pos2 are all about? I can carefully reuse that code (and thank you very much) but I would like to really understand the what, why, and wherefore.
  4. What characters can I use to escape, so that I can use braces as ordinary characters in my questions and examples?
  5. Ah yes, is there any way I can search everything I've posted? Because sometimes I find a typo in an example I've used in more than one place and I want to make sure the error didn't get replicated elsewhere. Currently I keep a Word document version of my posts for that purpose. Very inefficient, of course.


Emi-Ireland (talk)03:13, 11 November 2014
  1. There are two templates, {{usex}} and {{ux}}, that do the same thing but they are used differently (the parameter names are different). It's preferred to use them but it's not strictly wrong to do it the old way.
  2. It's not a 1 but a lowercase L. It stands for "link" or perhaps "list" and is a shortcut to the template {{link}}.
  3. The documentation for the template is located on its page, Template:affix. Generally speaking you can find information about any template (anything between {{ and the first |) by putting "Template:" before the name and then going to that page.
  4. Single braces are ok to use. It's only double or triple ones that need special treatment. Is that what you wanted to do?
  5. In the top right, there's a "Contributions" link which takes you to all the edits you ever made on Wiktionary. There's no real search feature but it may be helpful anyway.
CodeCat10:59, 11 November 2014

Thanks very much. That advice is very helpful. If you happen to see any Wauja entries that that done improperly or inefficiently, I would be grateful to hear about it. Currently, I am working by myself, and I'm a newbie. Soon I will get the Wauja/Portuguese site going. Next summer, I will go back to Brazil to train young Wauja to add entries to the Wauja/Portuguese site, since the young people are bilingual in Wauja and Portuguese, and that dictionary will be very important to them. Many of the young Wauja very capable at learning how to use unfamiliar digital tools. Though none of the three rainforest villages has a reliable internet connection, they have managed to make initial lexicons as MS Word documents, and the whole community is collaborating on standardizing the orthography. That's why I want to learn best practices myself, so I don't pass on bad habits. Thanks very much for the templates and other advice you have kindly provided.

Emi-Ireland (talk)15:58, 11 November 2014

I'd be happy to give feedback, but I won't be able to check every single entry let alone comment on it. I would recommend adding every Wauja page you create/edit to your watchlist (there is a checkbox for it above the button when you save your edits).

The watchlist is available through the link at the top right (next to the contributions link), and it automatically shows any edits that are done to any pages that are on the list. It works like a news feed or a favourites list.

If you check your watchlist regularly (once a day or so) you will be able to see when others have edited Wauja entries. That way you can see what they have changed (using the history/diff button) and that might help you improve things. I could of course notify you when I change one of your entries, but it's easier if you automatically get notified through your watchlist.

Of course you're still free to message me any time you want feedback or have a question, but this should help anyway.

CodeCat16:22, 11 November 2014

Sami/Finnic correspondences

Proto-Samic is absolutely full of Proto-Finnic (and later) loanwords. It is a good idea to not assert two similar words as cognates unless you know the further etymology, or exact references.

You can mostly check this thru the Álgu database… but you'll have to know the notation. So for example on the entry for *ājkē, you will note the type of arrow to Finnish is "!<". This does not mean "not loaned from", it means "possibly loaned from, yet in contradiction to another source".

In this particular case, given that

  1. in inherited Uralic vocabulary *a is reflected as Samic *uo
  2. the Finnic word has been explained as a Germanic or Indo-Iranian loan

loaning seems to be the better explanation.

Tropylium (talk)23:45, 8 November 2014

I used "cognate" precisely because I didn't want to commit to anything either way. It means a word is from the same common origin, but that doesn't mean Finnic can't be that origin. I realise this may not be how the term is commonly understood, but how else do you say "from the same source" if not "cognate"?

CodeCat23:51, 8 November 2014

I believe the term for "same origin as this word, possibly via descent from it" is "akin to". On the Finnic side this doesn't work either though, as descendants do not belong in the etymology section (much like how we do not claim that Latin hora is "cognate to" English hour.)

Tropylium (talk)23:59, 8 November 2014

Sami/Finnic correspondences

Proto-Samic is absolutely full of Proto-Finnic (and later) loanwords. It is a good idea to not assert two similar words as cognates unless you know the further etymology, or exact references.

You can mostly check this thru the Álgu database… but you'll have to know the notation. So for example on the entry for *ājkē, you will note the type of arrow to Finnish is "!<". This does not mean "not loaned from", it means "possibly loaned from, yet in contradiction to another source".

In this particular case, given that

  1. in inherited Uralic vocabulary *a is reflected as Samic *uo
  2. the Finnic word has been explained as a Germanic or Indo-Iranian loan

loaning seems to be the better explanation.

Tropylium (talk)23:43, 8 November 2014
First page
First page
Previous page
Previous page
Last page
Last page