Wiktionary:Bots/Tasks

From Wiktionary, the free dictionary
Jump to navigation Jump to search

Bot tasks[edit]

Formatting[edit]

formatting tasks which may be botted.

  1. Translation section cleanup and standardization
    1. Sections identified with {{rfc-trans}} need subdivision, with attention paid to numbers in parenthesis, all ambiguous entries plopped into a new ======Translations to be checked====== (TTBC) sub-section, with {{checktrans}}.
    2. TTBC sections with {{checktrans}} but no {{ttbc}}s need to have language names "wrapped" in {{ttbc}}. (Currently done semi-manually after each XML dump.)
    3. Sub-sections properly starting with "; gloss" should all have {{top}}/{{mid}}/{{bottom}} balancing the left and right columns to have about the same number of languages.
      1. Caution note: sub-sub-sub sections for language groupings should not split across columns
  2. Pronunciation sections need standardization
  3. See also: WT:NORM

Import[edit]

text or other content which may be botted.

  1. automated image/audio uploads and inclusion (some tasks in the works)
  2. public domain dictionaries
  3. translations from elsewhere (extended discussions in protest: caution!)
  4. dictdefs from Wikipedia (not transwiki'ed, but glommed from introduction sections. This can possibly be semi-automated, never fully automated.)

Housework[edit]

housework tasks which may be botted

  1. archival (working, see Werdnabot)
  2. counter-vandalism (discuss)
  3. mass blocking (done: ask Connel for scripts)
  4. mass deletion (done: ask Connel for scripts)
  5. mass protection (unlikely to get support.)
  6. web-based category renaming tool (maybe an extension?)
  7. automated category adding tools
  8. inflection normalizing
    1. See Rod's bots (and comensurate controversy/blockings/wheel warring, etc.)
  9. Various normalizing tasks
    1. tasks need to be combined to minimize trivial edits
    2. User talk:Connel MacKenzie/Normalization of articles <-- scour for approved tasks (not all!)
  10. Wikisaurus bots (discuss)
    1. Generate entries from NS:0 entries with Synonyms or Antonyms
    2. Update existing entries from NS:0 entries with Synonyms or Antonyms
    3. Update NS:0 entries to point to WS rather than repeat lists
  11. WelcomeBot (do we want one?)
    IMO, no, and even if we did want users to be welcomed automatically, we could use mw:Extension:NewUserMessage instead. --Yair rand 01:44, 18 February 2010 (UTC)[reply]
  12. Special: WT:RFV archiving. If entry exists, and title is striken then move section to a new section of Talk:{PAGENAME}. If title is stricken, and entry does not exist, move to RFVA. BOTH CASES: only if last comment is one week old or older.
  13. Automatically create topic categories when there are pages that are in that category.
  14. Change non-templated translations to use the {{t}} template.
  15. Add {{also}} where necessary.
  16. Notify users when they have added {{rfv}}, {{rfd}}, etc to a page but have not added a section dealing with that page to the appropriate discussion room.

WT:WOTD stuff[edit]

  1. Rebuild alphabetic list of words used in an automatic fashion.
  2. Identify duplicates better, sooner, faster.
  3. Compile various feeder lists better.
  4. Auto-fill next month semi-automatically? No.

Update the various sections automatically after each XML dump:

  1. Wiktionary:Translations to be checked#Languages nearly complete
  2. Wiktionary:Translations to be checked#Single entries
  3. Wiktionary:Translations to be checked#High-volume languages

WOTDs[edit]

  1. Updated alphabetized list is needed
  2. Auto tag entries of former WOTDs, saying "This word was WOTD on 'suchandsuch' date" as a footnote of some sort
  3. semi-protect the entries themselves the day they are WOTD? Unprotect them the day after.

Periodic Runs[edit]

  • replace.py
    -regex -ref:Template:Plural_of "\{\{plural of\|([^[]+?)}}" "{{plural of|[[\1]]}}"
    -regex -ref:Template:Past_of "\{\{past of\|([^[]+?)}}" "{{past of|[[\1]]}}"
    -regex -ref:Template:Alternative_spelling_of "\{\{alternative spelling of\|([^[]+?)}}" "{{alternative spelling of|[[\1]]}}"
No longer relevant/applicable. Benwing2 (talk) 03:48, 27 September 2020 (UTC)[reply]

copyvio bot[edit]

For existing English entries on en.wikt, take all text from "#" lines, and compare to default results from dictionary.com (etc.) and when 90%of words therein (or more) match, blank entry and replace with {{copyvio suspected}}. (Note: this 1) may not be possible, 2) needs significant discussion on WT:BP before starting.) --Connel MacKenzie 03:59, 16 October 2006 (UTC)[reply]

  1. Secondary task: accumulate statistics on such reversions. Auto-block any user with 50% or more copyvios, and autoblank all remaining entries from that contributor. --Connel MacKenzie 04:03, 16 October 2006 (UTC)[reply]
We might want to have a look at w:User:Wherebot for comparison or a jumping-off point. It sounds to me like it's doing a fairly similar function on Wikipedia, if not quite as specific. We might want to check etymologies, too. I've seen a few questionable ones go by. —Dvortygirl 04:07, 16 October 2006 (UTC)[reply]
  • Other comments from TheDave: weight the match criteria (for only exact matches) if the definition is only one short sentence, and only consists of one definition. **Also have a secondary weight, for tagging entries as "possibles", that need rewording. --Connel MacKenzie 04:36, 16 October 2006 (UTC)[reply]

rfvfailed[edit]

Hi guys. Apparently orphaned pages from {rfvfailed} are putting a spanner in the works. For a full discussion see DAVilla's talk page. A suggestion is to alphebetise all {rfvfailed}} entries, and to link them somehow to the relevant [WT:RFVA] page where the full discussion is archived. The link is the difficulty. If there are any other elegant solutions, they would be appreciated. Andrew massyn 19:36, 27 October 2006 (UTC) P.S. If it cant be done automatically, it can be done manuall, using a different system. Please let me know and i will institute the manual way. Andrew massyn 19:39, 3 November 2006 (UTC)[reply]

WT:DEL + Killbot.m seems to be handling these fairly well, now. --Connel MacKenzie 06:39, 2 March 2008 (UTC)[reply]

Conlang Pronunciations[edit]

Is there any chance that someone could make a bot that adds pronunciation to constructed language entries? It seems that most constructed languages follow pretty simple phonology rules, and it's a pretty big waste of time to add pronunciations to each of them manually... --Yair rand 19:54, 1 February 2010 (UTC)[reply]

Category creation[edit]

Bots could create some of the (easy-to-create) categories lying at Special:Wantedcategories --Rising Sun talk? 02:04, 18 February 2010 (UTC)[reply]

I am doing this every 3 days or so. Benwing2 (talk) 03:48, 27 September 2020 (UTC)[reply]

Links inside form-of templates[edit]

There should be a bot to remove links inside form-of templates. WT:Grease_pit_archive/2010/June#Links_inside_form-of_templates. --Yair rand (talk) 06:42, 4 November 2010 (UTC)[reply]

change position of template:beginning Mandarin and similar templates[edit]

Many Chinese words definition (example) begin with beginning Mandarin, intermediate or advanced Mandarin. At best it sounds vague, at worst it seems to mean that the word is not used by native speakers or that is some kind of ancient or esoteric language. It would be more logical to move that to a note section below the definition. Then the template could make it explicit what these categories mean. I have left a message at Appendix talk:HSK list of Mandarin words/Beginning Mandarin. I did not get any reply so I assume I can go ahead an request a bot to do it.--Zolo 06:04, 8 June 2011 (UTC)[reply]

Swedish proper names[edit]

Can some bot do the following:

Thank you!Jonteemil (talk) 18:24, 16 July 2017 (UTC)[reply]

@Robbie SWE does this request make sense? - TheDaveRoss 13:52, 5 October 2017 (UTC)[reply]
Hmm, I guess it makes sense even though I personally don't like the idea of adding gender to given names and surnames. Swedish Wiktionary adds gender so it probably is ok. --Robbie SWE (talk) 17:23, 5 October 2017 (UTC)[reply]
I think I have gotten all of these done. - TheDaveRoss 18:40, 10 October 2017 (UTC)[reply]
@TheDaveRoss Hello! Thanks for your edits two years ago :). Do you mind checking the categories once again. There are, you see, still 581 entries that use {{head|proper noun}}. I've added a few categories as well to get as many as possible to use {{sv-proper noun}}.Jonteemil (talk) 07:17, 14 November 2019 (UTC)[reply]

German adjectives[edit]

Can someone change all entries in Category:German comparative adjectives that have this syntax:

===Adjective===

{{head|de|adjective comparative form}}

# {{comparative of|de|{{#invoke:string|sub|s={{PAGENAME}}|i=1|j=-3}}}}

to this

===Adjective===

{{head|de|adjective comparative form}}

# {{comparative of|de|{{subst:#invoke:string|sub|s={{subst:PAGENAME}}|i=1|j=-3}}}}

# {{inflection of|de|{{subst:#invoke:string|sub|s={{subst:PAGENAME}}|i=1|j=-3}}||str//mix|nom|m|s|;|str|gen//dat|f|s|;|str|gen|p}}

[[Category:German adjective forms]] ?

The irregulars, such as größer, don’t fit the syntax and should therefor not be changed. The reason to this change is because adjective+er isn't only comparative but also positive in some cases.Jonteemil (talk) 06:49, 14 November 2019 (UTC)[reply]

Are there actually uses of {{#invoke:...}} in entries? They should be wrapped in a template. Also, these templates are ridiculous, seems like there should be five good templates instead of one arcane one. - TheDaveRoss 13:29, 11 December 2019 (UTC)[reply]
@TheDaveRoss: No {{subst:#invoke:string|sub|s={{subst:PAGENAME}}|i=1|j=-3}} is just how you write {PAGENAME} minus the last letter. So for this page it becomes ”Bots/Tas”. I agree that the template is very weird indeed.Jonteemil (talk) 23:03, 12 December 2019 (UTC)[reply]
I did something more comprehensive than this, and changed the complex {{inflection of}} invocations to {{de-adj form of}}, which automatically generates the same results. Benwing2 (talk) 03:50, 27 September 2020 (UTC)[reply]

German adjectives 2[edit]

Can someone change all entries in Category:German adjective forms that have this syntax:

===Adjective===

{{head|de|adjective form}}

# {{inflection of|de|/positive form/||str|gen|m//n|s|supd|;|wk//mix|gen//dat|all-gender|s|supd|;|str//wk//mix|acc|m|s|supd|;|str|dat|p|supd|;|wk//mix|all-case|p|supd}}

or this one

===Adjective===

{{head|de|adjective form}}

# {{superlative of|de|/positive form/}}

and all entries in Category:German superlative adjectives that have this syntax:

===Adjective===

{{head|de|adjective superlative form}}

# {{superlative of|de|/positive form/}}

to

===Adjective===

{{head|de|adjective superlative form}}

# {{superlative of|de|/positive form/}}

# {{inflection of|de|/positive form/||str|gen|m//n|s|supd|;|wk//mix|gen//dat|all-gender|s|supd|;|str//wk//mix|acc|m|s|supd|;|str|dat|p|supd|;|wk//mix|all-case|p|supd}}

[[Category:German adjective forms]] ?

This applies to all adjectives, including the irregular ones.Jonteemil (talk) 23:46, 9 December 2019 (UTC)[reply]

@Jonteemil I did something more comprehensive than this, and changed the complex {{inflection of}} invocations to {{de-adj form of}}, which automatically generates the same results. Benwing2 (talk) 03:51, 27 September 2020 (UTC)[reply]
@Benwing2: Okay, perfect. I don't know how I feel reg. the double adjective headings though. How did you think there?Jonteemil (talk) 17:32, 27 September 2020 (UTC)[reply]
@Jonteemil They are different headword types. One is a sort of lemma, i.e. comparative or superlative adjective (even though it's technically classified as a non-lemma form), while the other is a form of a lemma. In such cases it's normal to use different headings. Benwing2 (talk) 17:35, 27 September 2020 (UTC)[reply]

Four-letter abbreviations[edit]

Possibly, a bot could add all entries that are four characters long and already in Category:English_abbreviations to Category:English_four-letter_abbreviations. The four-letter abbreviations category has been nominated for deletion but the only reason given focused on the fact the category had not been populated. Thank you. John Cross (talk) 07:30, 9 April 2020 (UTC) This should really say containing exactly four letters. John Cross (talk) 13:17, 3 June 2022 (UTC)[reply]

Hyphenated terms[edit]

I expect a bot could fully populate Category:English_terms_spelled_with_-. There may be better ways to achieve the same result. This category has been nominated for deletion. John Cross (talk) 07:35, 9 April 2020 (UTC)[reply]

Autocat to Auto cat[edit]

On a large number of category pages {{autocat}} is used when it should be {{auto cat}}. Not sure if this really matters. John Cross (talk) 11:58, 3 June 2022 (UTC)[reply]

This doesn't matter at all, and would be a waste of time to change. Theknightwho (talk) 21:07, 1 July 2022 (UTC)[reply]