Wiktionary:Information desk

Definition from Wiktionary, the free dictionary
(Redirected from Wiktionary:ID)
Jump to: navigation, search

Wiktionary > Discussion rooms > Information desk

You can search in the archives of Information desk:

Welcome to the Information desk of Wiktionary, a place where newcomers can ask questions about words and about Wiktionary, ask for help, or post miscellaneous ideas that don’t fit in any of the other rooms.

To start a new topic, clicking on the “+” tab, or click here: Start a new topic.

Sign your comments with four tildes (~~~~), code which produces your signature, followed by a UTC timestamp.

For past questions, see /Archives.


May 2017

Name for the groups of sockets/switches that are protected by one circuit breaker?[edit]

In a typical home, you have a breaker panel/consumer unit with a bunch of breakers or fuses, and each of them protects a certain subset of all the electrical wiring in the house. Thus, one switch might disable only the kitchen, another the upstairs lighting, etc. What is the name of these subdivisions (not the switches themselves)? Is it "circuit"? In Dutch, they are called groep, and it has its own Wikipedia page w:nl:Groep (elektrotechniek), but it has no counterparts in other languages. I can't add that sense unless I know what the English term is. —CodeCat 14:47, 1 May 2017 (UTC)

I think it's called a "circuit", but I feel that might be ambiguous. --WikiTiki89 18:35, 1 May 2017 (UTC)
I would call it a circuit. There may be a better, more technical name but I couldn't find one in the Wikipedia articles on circuit breakers or domestic power supply. SemperBlotto (talk) 05:32, 2 May 2017 (UTC)

I wrote a definition now. Is this ok? —CodeCat 16:29, 9 May 2017 (UTC)

Written variant a word that only occurs due to the preceding word[edit]

In Middle Dutch, you might encounter tes coninx hove (to the king's court), but in a text I have also come across tes sconinx hove. In that latter phrase, the s of the preceding word has "bled" into the next one, but there's no reason to suspect that the two versions differed in pronunciation at all. It's just an orthographic difference. Going by our mission to cover all words in all languages, there should be an entry for sconinx, but I don't really know what to put in it. It's not an alternative spelling, is it? —CodeCat 21:23, 7 May 2017 (UTC)

Would that be a rebracketing, like ME ekename/nekename? Equinox 21:25, 7 May 2017 (UTC)
You could call it a misspelling, especially if it occurs only in that one text. —Aɴɢʀ (talk) 21:31, 7 May 2017 (UTC)
It's not really a rebracketing because the s is still present on tes. Would it be called a rebracketing if grandma's oven were written as grandma's zoven (not retorical, I'm curious)? As for misspelling, you can't really call it that either as there was no spelling standard back then. And while I'm not sure if this particular word occurs in only in this one instance, the phenomenon as a whole (that is, of a preceding s showing up also on the next word) does occur more often I believe. —CodeCat 21:51, 7 May 2017 (UTC)
You could call it an s-mobile variant! Seriously, if you don't like {{misspelling of}}, I think {{alternative spelling of}} is your best bet. —Aɴɢʀ (talk) 09:14, 8 May 2017 (UTC)
I just went with misspelling. But it's also a legitimate contraction apparently so I added that too. —CodeCat 16:28, 9 May 2017 (UTC)
Assuming the misspelling is chiefly found in the context described above, I've added {{q|after a word ending in s}}. - -sche (discuss) 18:06, 10 May 2017 (UTC)

Given names only occurring in works of fiction[edit]

While translating Middle Dutch, I've come across the two names nobel and isengrijn. Neither of these are names that were given to people, they were made up for the story. However, it is of course possible that these names were used for people later in reference to the characters that originally bore them, in the same way that we might now call someone a Voldemort for example. But given names, they never were. How should these be handled by Wiktionary? Our policy seems to disallow names from fiction, so that would mean they don't get an entry. But if they were allowed, what should their definition be? Do we have anything like {{given name}} for fictional characters? —CodeCat 19:35, 9 May 2017 (UTC)

Wow, I checked a massive international database of birth records and indeed found no one with the name Isengrijn, despite how relatively often the character is mentioned in Google Books-digitized books; impressive that no-one has used the name.
Given how old the story is and how relatively often it has been mentioned (in both Dutch and English), one could ask if the name should be treated like the names of e.g. minor figures from Greek mythology or history, who I think we include.
Those are handled without {{given name}}, by just summarizing who the one person by that name was (like Pandarus). - -sche (discuss) 20:09, 9 May 2017 (UTC)
@-sche I've created isengrijn, what do you think? —CodeCat 16:36, 10 May 2017 (UTC)
The definition looks good, in line with how Pandarus, Procne, etc are handled. As a name of a specific figure, it'd be a proper noun though, wouldn't it? And I guess we follow the source manuscripts' lack of capitalization, or should we normalize the capitalization? - -sche (discuss) 18:04, 10 May 2017 (UTC)
I've entered all names as uncapitalised regular nouns for Middle Dutch so far. —CodeCat 18:08, 10 May 2017 (UTC)
Hmm, let's get others' thoughts on that; I thought based on previous discussions of proper nouns that names were among the clearest examples of proper nouns. - -sche (discuss) 23:04, 10 May 2017 (UTC)
Yeah, even if they're uncapitalized, I would think they should still be labelled proper nouns... Andrew Sheedy (talk) 05:08, 11 May 2017 (UTC)
@-sche, Andrew Sheedy: I agree with both of you. — I.S.M.E.T.A. 21:29, 11 May 2017 (UTC)

Technical issue: no quotations and the like[edit]

It seems to me that all quotations have disappeared off Wiktionary as of today. They still appear in the edit field, and I can even see them by requesting my browser to show the source page, but they are invisible when browsing normally. I also noticed I cannot unfold declension tables, so there may be even more problems I have not yet discovered.

I tried different browsers on different computers, and the problem persists, so it appears to be at Wiktionary's end. Anybody knows what's going on? MuDavid (talk) 06:56, 11 May 2017 (UTC)

It's a site-wide issue that's been reported in several fora. I don't know the cause or if a bug report has been filed. —Μετάknowledgediscuss/deeds 06:58, 11 May 2017 (UTC)
Let's consolidate discussion in Wiktionary:Grease pit/2017/May#Show-and-hide_templates_not_working_properly. - -sche (discuss) 08:01, 11 May 2017 (UTC)

Philosophy against personal preferences[edit]

Does anybody know a word for someone who believes that having personal preferences is somehow mediocre or wrong, and instead focuses on learning about and enjoying the entire world around them? PseudoSkull (talk) 13:08, 11 May 2017 (UTC)

@PseudoSkull: The first thing that comes to mind is cosmopolitanism as a kind of counter to tribalism. Not sure if that quite fits the bill. Or being a citizen of the world. Some form of syncretism possibly? Maybe these are a start. —Justin (koavf)TCM 21:30, 11 May 2017 (UTC)
antisubjective? Rather rare and technical, probably used to describe approaches rather than individuals. Equinox 12:55, 12 May 2017 (UTC)

technical issue: oversized arabic headwords[edit]

Hi, I had added some code to increase the size of arabic script in my special page (I am afraid I cannot find the original post). Yet, the headwords have started to be several times bigger than the rest of the script in the entries. I'd like them to be the exact size as the rest. --Backinstadiums (talk) 09:44, 12 May 2017 (UTC)

Maybe it was this: User:Backinstadiums/common.css. —Stephen (Talk) 13:36, 14 May 2017 (UTC)

Rollback rights[edit]

How do you get rollbacker rights here? Do you have to start a vote or whatnot like for admins? I can't find a clear policy page telling how this is done. PseudoSkull (talk) 03:47, 13 May 2017 (UTC)

Hmm! But see Wiktionary_talk:Whitelist#Rollbacker. Equinox 03:52, 13 May 2017 (UTC)

cs for ç in Old French?[edit]

In Middle Dutch, francsoys is attested referring to the French language. That this is a loan is obvious, but I'm curious about the digraph -cs-. Is this an early form of the letter ç that the word is now spelled with? What would its pronunciation have been at the time? —CodeCat 15:51, 17 May 2017 (UTC)

In Old French, ç was also spelled with the digraph cz, so maybe cs was just another less common variant. It was pronounced /ts/. --WikiTiki89 19:05, 17 May 2017 (UTC)
@User:CodeCat This would make an interesting entry. PseudoSkull (talk) 04:31, 18 May 2017 (UTC)

Words with prefix a-[edit]

There are a few notes on alpha privatives, but nothing that I could see about words like ASLEEP, AWAKE, ASHAMED, AWARE, ABROAD and others, and also possibly across and along. Also I think the existing entries for the a suffix and the alpha privatives need work. Anyone up for that? —This unsigned comment was added by JDShepherd (talkcontribs).

Help creating new entry[edit]

I'm new to wiktionary and came across a new Spanish word that isn't in wiktionary so I would like to add it but I'm not sure how to go about it. Can I just copy the definition from any other online dictionary? What kind of verification is needed that it is indeed correct? --Polyknot (talk) 23:26, 21 May 2017 (UTC)

@Polyknot: Excellent question. Unfortunately, you probably cannot just copy and paste a definition. Dictionaries are eligible for copyright and unless the holder of the copyright allows for copying, then we can't accept something copied and pasted. What is the term? Do you want to show me a link to it? —Justin (koavf)TCM 04:52, 22 May 2017 (UTC)
The word is pegote. A search for "pegote spanish" shows some links with the main meaning being a "sticky mess" which I believe can be used both literally and figuratively. --Polyknot (talk) 07:35, 22 May 2017 (UTC)
@Polyknow: Good, so you can add your own definition(s) in your own words. Since--as you pointed out--it can evidently refer to both a literal situation and a figurative one, then you will need a definition that includes both senses. (Or at least, it's better to have that--entries don't have to be perfect, as this is a wiki after all.) A lot of the entries on Wiktionary employ templates for linking, categorization, and adding structure. This is a big difference from Wikipedia or many of our other sister projects. I would recommend taking a look at the source of a few Spanish entries (e.g. árbol or pie) and see how they are formatted, then try to post your definition as an experiment. If you're uncomfortable putting it in the main namespace at "pegote", then you could put it at User:Polyknot/pegote as a start. Once you've done that, let me know by posting here and I can take a look. —Justin (koavf)TCM 08:33, 22 May 2017 (UTC)
Thanks for your help @Koavf:. I went ahead and created the page with the only definition I understand for now as well as the page for it's plural. I forgot about the namespace you mentioned though. What is a namespace? --Polyknot (talk) 13:17, 22 May 2017 (UTC)
@Polyknot: No problem. A "namespace" is a way of segregating out different types of material in Wiktionary (and our sister wikis). Certain types of content form the main dictionary itself and there are other related namespaces that make up the main content of the site (such as the Appendix namespace or Thesaurus namespace). Others are for discussion or user pages. For instance, this page is under the Wiktionary namespace which designates that whatever content you see is supposed to be talking about this dictionary and how to improve it rather than the content of the dictionary. See above where the title of the page is "Wiktionary:Information desk/2017/May". You can also look at Wikipedia's guidelines on namespaces here for more guidance: w:en:WP:NAMESPACE. This entry looks like a fine start so far. If you plan on doing more work with Spanish entries, I can provide a little help there, too as I know some Spanish and have edited a little bit of Spanish here. A useful resource is the DRAE and we link to it using {{R:DRAE}} but in this case, they don't have a definition for "pegote". Let me know if you need anything else and bienvenido. —Justin (koavf)TCM 15:57, 22 May 2017 (UTC)

Appendix:Snowclones/take this X and shove it[edit]

What does this phrase mean? ‘Keep this lousy thing to yourself’? — (((Romanophile))) (contributions) 19:56, 22 May 2017 (UTC)

"You have offered or given me something that I don't want. Instead of accepting it graciously, I want you to take it back and stick it inside of your anus." It shows disgust and is frequently used in the context of quitting a job that you hate: "take this job and shove it". (But it's not likely that many real human beings use this phrase when quitting a job--it's used in songs and films.) —Justin (koavf)TCM 20:22, 22 May 2017 (UTC)

Nominalization[edit]

Is there a template to use for "Nominalization" in etymologies? Look at Ich and Neues.--Bigbossfarin (talk) 07:50, 23 May 2017 (UTC)

Why fewer entries than WP?[edit]

Why is it that we have fewer mainspace pages than that place I dare not go back to; Wikipedia? Around 200,000 less pages to give a little insight. I understand we have less editors, but for one, it is so much easier and faster to create pages here, and for two, pages there tend to be significantly longer than here and tend to take up so much time and effort and space, and have a lot more individual content. For instance, compare the Wiktionary entry for homeowner association to the Wikipedia article of the same name. I will say, though, we are catching up with WP rather quickly it seems. There are so many more individual words (in all languages) than there are encyclopedic topics in just the English language I'm sure. PseudoSkull (talk) 02:51, 29 May 2017 (UTC)

Dictionaries have no need for entries on many encyclopedic subjects. For example, see en:w:Athletics at the 1956 Summer Olympics – Men's 3000 metres steeplechase; nobody's going to create a dictionary entry for athletics at the 1956 Summer Olympics – Men's 3000 metres steeplechase, or athletics at the 1956 Summer Olympics, or athletics at the Summer Olympics. Nyttend (talk) 03:39, 29 May 2017 (UTC)
Because Wikipedia is far more popular and well-known. Plus, most of us are far more interested in reading an encyclopedia than a dictionary. Altho both are reference works, it is much less likely that anyone will spend time really digging into a dictionary than click from topic to topic to topic at an encyclopedia. —Justin (koavf)TCM 05:48, 29 May 2017 (UTC)
  • Because most titles of Wikipedia articles would be SOP here. There are more things in the world than there are words. Pretty simple logic. Not sure what you're complaining about. --WikiTiki89 17:36, 29 May 2017 (UTC)

It's not really a contest. Dictionaries and encyclopedias have a more-or-less definable amount of entries they could have (e.g. w:en:User:Emijrp/All_Human_Knowledge). Projects like Wikidata and Wikisource are a little more difficult to put a number on unless you can really sharply define some parameters. Projects like Wikiversity or Wikinews are basically impossible to quantify by their nature. —Justin (koavf)TCM 20:13, 29 May 2017 (UTC)

This page is telling me that Wikipedia and other Wikimedia sites (like Wiktionary) will likely be used to enhance self-expanding AI? PseudoSkull (talk) 03:34, 1 June 2017 (UTC)
@PseudoSkull: Singularities aside, the page is an attempt to measure out how big Wikipedia should be. —Justin (koavf)TCM 04:46, 1 June 2017 (UTC)

dash[edit]

In the entry for dash, I'd like to change the Nigerian usage from "Nigeria" to "West Africa", since the same usage appears in Liberian English (see Some Terms from Liberian Speech, page C5), but I'm not sure whether it would be better to change the contents of {{lb}} to "Nigeria and Liberia", or "West Africa", or something else. I very rarely edit here, so I'm not sure where to look beyond the documentation for {{label}}, and that's not particularly helped me, unfortunately. Nyttend (talk) 03:37, 29 May 2017 (UTC)

If it's mainly used in Nigeria and Liberia, {{lb|en|Nigeria|and|Liberia}} would be the way to go (with vertical lines setting off the "and" so that each country is parsed as a separate label); If it's used broadly across West Africa, then {{lb|en|West Africa}} would work. I have edited Module:labels/data/regional so that CAT:Nigerian English and CAT:Liberian English, or CAT:West African English, will be added when the label is used. - -sche (discuss) 06:19, 29 May 2017 (UTC)

regular expressions[edit]

Hi, I'd like to be led to the page about searching itself, so that I can see how to use regex. Is it possible with any kind of script, e.g. Arabic? how can it be implemented for chinese characters? --Backinstadiums (talk) 17:33, 31 May 2017 (UTC)

@Backinstadiums: All documentation like that is at mw:. —Justin (koavf)TCM 18:01, 31 May 2017 (UTC)
@Koavf: could you specify where? I cannot find it --Backinstadiums (talk) 18:14, 31 May 2017 (UTC)
@Backinstadiums: All I can point to is the help documentation on searching. —Justin (koavf)TCM 18:21, 31 May 2017 (UTC)
@Koavf: this one's better https://www.mediawiki.org/wiki/Help:CirrusSearch --Backinstadiums (talk) 18:40, 31 May 2017 (UTC)

June 2017

Are these valid citations of Windows?[edit]

  • [1]: As a MAC girl, I lose sleep over why people use Windows.
  • [2]: There's a reason people use Windows. It just werks.
  • [3]: ... have either vandalized or enhanced (depending on who you are) an upcoming Apple Store with a Windows logo.
  • [4]: Windows users are generally regarded as either thieves, people who lost their keys, or just complete morons too stupid to use doors.
  • [5]: If Mac Users Care About The Environment More Than Windows Users?

--2001:DA8:201:3512:BCE6:D095:55F1:36DE 23:05, 3 June 2017 (UTC)

  • Looks reasonably good to me. Not all of us agreed to the deletion of a word that obviously exists. SemperBlotto (talk) 05:54, 4 June 2017 (UTC)
    • It seems Firefox (deleted previously) can also be cited:
  • [6]: I use Firefox and MacOS, so I'm not afraid of viruses,
  • [7]: But I've realized that many Meezers dont use Firefox
  • [8]: You have to use Firefox and Yahoo for the rest of your life.
  • [9]: And with the right accessories, they can do a spot-on Firefox impression.
  • [10]: They also have a Firefox.

--2001:DA8:201:3512:BCE6:D095:55F1:36DE 06:41, 4 June 2017 (UTC)

    • Also XP:
  • [11]: For those XP users, ever wanted to muted certain applications like vista can??
  • [12]: Avast! AntiVirus in XP VM under Fusion
  • [13]: Check it out, I found something that does what CDA does and works with XP!
  • [14]: you got to limit your game because people still use XP with slow processors
  • [15]: although I can't use it because I still run XP.

--2001:DA8:201:3512:BCE6:D095:55F1:36DE 07:09, 4 June 2017 (UTC)

  • Yes. Our mission statement is "all words in all languages". But not everybody agrees with it. SemperBlotto (talk) 07:27, 4 June 2017 (UTC)

West (of) England vs West Country[edit]

Is "West England"/"West of England" synonymous with "West Country"? In particular, do they both include Cornwall? If "West England" doesn't include Cornwall, then perhaps it shouldn't be listed as a synonym of "West Country" in Module:labels. - -sche (discuss) 21:18, 5 June 2017 (UTC)

Latin vowel length[edit]

hi guys! really appreciated the time and effort that you took for the latin entries, especially the use of macrons for vowel length! one thing though, is it possible to create a list of all of the latin words with macrons? because when I used Special:Export and added the category Latin lemmas, it does not have the macrons. any help will be appreciated, as it would make my experience with latin easier, especially when i have no internet. thank you all! —This unsigned comment was added by 203.111.89.251 (talk) at 10:11, 6 June 2017 (UTC).

That's a bit like asking for all the Latin words that contain the letter r. —Tamfang (talk) 08:18, 8 June 2017 (UTC)
I'm not certain, but I think you can find such information in the latest database dump (SQL dump, XML dump, or whatever it's called). You should ask -sche. I think he knows about it. —Stephen (Talk) 05:05, 13 June 2017 (UTC)
Macrons are not used in entry names (page titles), only in the text in the entry. Latin headwords (the line of text at the top of the part-of-speech section) that contain macrons could be put in a category, using Module:headword. However, the macrons wouldn't display on the category page, and I don't know how to get a complete list of entry names, much less a complete list where the list entries have macrons. But perhaps a bot could do it. @DTLHS has made quite a few lists with his bot before. — Eru·tuon 05:21, 15 June 2017 (UTC)
I would need a list of Latin headword templates and the parameters they could take. DTLHS (talk) 05:23, 15 June 2017 (UTC)
It looks like all the templates use either |1= or |head= for the displayed headword. (Additional headwords will appear in |2= or |head2=, |3= or |head3=, etc. Those should probably be ignored if they are not equal to the headword when macrons are removed.) See the list of headword templates below. {{head}} might be used too. — Eru·tuon 05:49, 15 June 2017 (UTC)
list of Latin headword templates
Template:la-adj-1&2
Template:la-adj-3rd-1E
Template:la-adj-3rd-2E
Template:la-adj-3rd-3E
Template:la-adj-comparative
Template:la-adj-form
Template:la-adj-superlative
Template:la-adv
Template:la-comp-form
Template:la-diacritical mark
Template:la-gerund
Template:la-gerund-form
Template:la-gerundive
Template:la-interj
Template:la-phrase
Template:la-suffix
Template:la-letter
Template:la-location
Template:la-noun
Template:la-noun-form
Template:la-proper noun-form
Template:la-proper noun
Template:la-num-1&2
Template:la-num-card
Template:la-num-form
Template:la-part-form
Template:la-future participle
Template:la-perfect participle
Template:la-present participle
Template:la-pronoun
Template:la-pronoun-form
Template:la-punctuation mark
Template:la-suffix-1&2
Template:la-suffix-3rd-2E
Template:la-suffix-adv
Template:la-suffix-form
Template:la-suffix-noun
Template:la-verb
Template:la-verb-form
Here you go: tab delimited, or json if you prefer. Here are the rules that I used: {'la-adj-1&2': [1, 'adjective'], 'la-adj-3rd-1E': [1, 'adjective'], 'la-adj-3rd-2E': [1, 'adjective'], 'la-adj-3rd-3E': [1, 'adjective'], 'la-adj-comparative': [1, 'adjective'], 'la-adj-superlative': [1, 'adjective'], 'la-adv': [1, 'adverb'], 'la-interj': [1, 'interjection'], 'la-phrase': ['head', 'phrase'], 'la-location': [1, 'proper noun'], 'la-noun': [1, 'noun'], 'la-proper noun': [1, 'proper noun'], 'la-num-1&2': [1, 'numeral'], 'la-verb': [1, 'verb']} DTLHS (talk) 18:11, 15 June 2017 (UTC)

bačičobi[edit]

What language has a word for 'right (turn)' that sounds like bačičobi? —Tamfang (talk) 16:59, 10 June 2017 (UTC)

Term black people use for white people in films[edit]

In German dubs of Hollywood films, the go-to slang term for black characters to refer to white people, whether threatening or affectionate, is Weißbrot (wheat bread). I can't find what they say in the originals. My assumption would be 'cracker' but that seems more harsh than it comes across in the dubs. Korn [kʰũːɘ̃n] (talk) 12:35, 12 June 2017 (UTC)

A few ideas: Wikisaurus:white person, as well as the synonyms in whitey --Barytonesis (talk) 12:42, 12 June 2017 (UTC)
Actually it might be a direct calque from white bread. --Barytonesis (talk) 12:49, 12 June 2017 (UTC)
I think honky is the term usually used in films and TV shows. —Aɴɢʀ (talk) 21:01, 12 June 2017 (UTC)

Hello. How can I get the entire list of words in a category?[edit]

For example, at the bottom of this page, it says there are 35,830 English verbs in total, but I can only see 200 of them at a time.

https://en.wiktionary.org/wiki/Category:English_verbs

How can I get the entire list of 35,830 verbs?

I am making a word list for learners of English and I'll credit Wiktionary in it. Thanks. Eopluo (talk) 10:17, 14 June 2017 (UTC)

@Eopluo: You can use w:en:WP:AWB to generate lists from categories. —Justin (koavf)TCM 15:22, 14 June 2017 (UTC)
Wow. I'll try that. You're a lifesaver. I was seriously planning to download all the words manually. Eopluo (talk) 04:14, 15 June 2017 (UTC)
@Eopluo: No problem. Let me know how else I can help. —Justin (koavf)TCM 04:19, 15 June 2017 (UTC)
AWB has a painful upper limit of pages to fetch tho. AFAIK it is 200. Dixtosa (talk) 04:57, 15 June 2017 (UTC)
@Dixtosa: It's 25,000. If the OP needs more, he will have to use some other tool or a database dump. —Justin (koavf)TCM 05:10, 15 June 2017 (UTC)
Yes I was wrong thank you for clarification.--Dixtosa (talk) 17:30, 16 June 2017 (UTC)

It isn't obvious how to add categories to an entry[edit]

I think I have looked in quite a few places in the Help section to see how to add a category to an entry, but there doesn't seem to be any instructions. I ended up doing the same thing I do in Wikipedia edits, i.e. putting something like [[Category:...]] at the very end (in this case [[Category:en:Animal foods]] at the end of the entry for swill), but I'm not sure how the general newcomer will know what to do, and especially little details like including "en:" for English (and perhaps I am wrong? Shouln't I be using some curly-bracket structure? Yet it seemed to work).

So I think some more documentation is needed somewhere, and maybe with some help about using categories... e.g. answer questions like:

  • where to put the categories, and
  • what categories are added automatically;
  • is "Animal foods" also for birdseed - some ways of thinking of "Animal" include birds, some don't... is there a place to look at what people think a category should be for?
  • Should something also be added to some Wikisaurus entry when something is added to a set? Is there a fast way to do both at once (there are a lot of gaps, e.g. "Animal foods" only has a handful of entries at the moment).
  • What if there are several words for exactly the same thing (e.g. alfalfa and lucerne) or even alternative ways of writing the same thing (e.g. with or without a hyphen or space)... should they all be listed for a given category (I presume not, but where do I find guidelines?)?

Thanks, Maitchy (talk) 02:13, 15 June 2017 (UTC)

1. yes, I think most people would consider animals to be birds. 2. I wouldn't bother with Wikisaurus pages at all, the whole thing is not well thought out. 3. If there are different words for the same thing they can both be in the same category. Alternative spellings however (spaces / hyphens), just categorize the main form. DTLHS (talk) 02:22, 15 June 2017 (UTC)
@Maitchy: I agree that sometimes this project can be a little confusing, especially in regards to categorization. E.g. many Portuguese lemmas are categorized by templates. It's probably preferable to add categories to the part of a definition that matches that language. I.e. If you have a category like [[Category:en:Birds]] put that code near the English definition, immediately before any horizontal lines which separate languages. Do you have any more specific questions? —Justin (koavf)TCM 03:57, 15 June 2017 (UTC)

List of ideophones by language?[edit]

I noticed today that Wikipedia used to have a 'List of ideophones by Language', which was deleted in January 2017 - in part on the grounds that it had a list of words which would be better off here on Wiktionary. I agree with the sentiment, but can't find anything like this list on WT. Some of the words that were there have entries on here in various languages, but most of the handful that I checked seem not to.

I'm not very wiki-savvy, and almost entirely new to Wiktionary, so I don't know how/if one can access deleted WP pages. But, at least some version of the page is cloned on this terrible website. As far as I understand the spirit of the criteria for inclusion, most of those words ought to be included. Does anyone with more knowhow want to take this on?

I'm putting this here rather than in requests partly because it's not a request for a specific word-level entry, and partly because I'm a noobie. —This unsigned comment was added by WmGB (talkcontribs).

You can ask the admin who deleted in on en.wp (or any other admin for that matter) for a copy of the file exported as XML and then it can be imported here. —Justin (koavf)TCM 19:15, 22 June 2017 (UTC)
The List of ideophones by language was deleted by Timotheus Canens. He can easily access the deleted page if you want to save its contents. Note that entries in any given language must be in the proper script, so there will probably be a bit of work for you to convert these to the correct script and spelling. —Stephen (Talk) 07:08, 23 June 2017 (UTC)
@WmGB: I'm an admin on en.wiki so I took the liberty of copying the contents of the page prior to its deletion to my userpage: User:BigDom/List of ideophones by language. BigDom 10:31, 27 June 2017 (UTC)

Wiktionary and Wikipedia dump files understanding for parsing and indexing[edit]

Discussion moved from Wiktionary talk:Information desk/Archive 2014/January-June.

Wiktionary and Wikipedia dump files detailed understanding for parsing and indexing as it is not clear about which dump file to download to parse and index. For Wiktionary I have downloaded enwiktionary-20170501-pages-articles.xml mentioned xml dump but its fetching only 23742 english words and that too its not containing basic words. So can you help me out with the complete dump thing so that i will be able to make a dictionary of almost 1 lakh words. Same thing with Wikipedia able to fetch only ~851000 articles which contains Infobox but i guess its too small for a Wikipedia thing downloaded file is enwiki-20170601-pages-articles.xml. So awaiting your reply Thanks Himanshu.k (talk) 16:05, 27 June 2017 (UTC)

@Himanshu.k pages-meta-current.xml and pages-articles.xml work well for parsing. Maybe your parser is crashing before it finishes reading the whole file. — Ungoliant (falai) 16:15, 27 June 2017 (UTC)