Wiktionary:Bots/Tasks

Definition from Wiktionary, the free dictionary
Jump to: navigation, search

Bot tasks[edit]

Formatting[edit]

formatting tasks which may be botted.

  1. Translation section cleanup and standardization
    1. Sections identified with {{rfc-trans}} need subdivision, with attention paid to numbers in parenthesis, all ambiguous entries plopped into a new ======Translations to be checked====== (TTBC) sub-section, with {{checktrans}}.
    2. TTBC sections with {{checktrans}} but no {{ttbc}}s need to have language names "wrapped" in {{ttbc}}. (Currently done semi-manually after each XML dump.)
    3. Sub-sections properly starting with "; gloss" should all have {{top}}/{{mid}}/{{bottom}} balancing the left and right columns to have about the same number of languages.
      1. Caution note: sub-sub-sub sections for language groupings should not split across columns
  2. Pronunciation sections need standardization
  3. Move content over top language header other than {{also}}, templates using {{character info}}, and {{selfref}} to under the first language header.

Import[edit]

text or other content which may be botted.

  1. automated image/audio uploads and inclusion (some tasks in the works)
  2. public domain dictionaries
  3. translations from elsewhere (extended discussions in protest: caution!)
  4. dictdefs from Wikipedia (not transwiki'ed, but glommed from introduction sections. This can possibly be semi-automated, never fully automated.)

Housework[edit]

housework tasks which may be botted

  1. archival (working, see Werdnabot)
  2. counter-vandalism (discuss)
  3. mass blocking (done: ask Connel for scripts)
  4. mass deletion (done: ask Connel for scripts)
  5. mass protection (unlikely to get support.)
  6. web-based category renaming tool (maybe an extension?)
  7. automated category adding tools
  8. inflection normalizing
    1. See Rod's bots (and comensurate controversy/blockings/wheel warring, etc.)
  9. Various normalizing tasks
    1. tasks need to be combined to minimize trivial edits
    2. User talk:Connel MacKenzie/Normalization of articles <-- scour for approved tasks (not all!)
  10. Wikisaurus bots (discuss)
    1. Generate entries from NS:0 entries with Synonyms or Antonyms
    2. Update existing entries from NS:0 entries with Synonyms or Antonyms
    3. Update NS:0 entries to point to WS rather than repeat lists
  11. WelcomeBot (do we want one?)
    IMO, no, and even if we did want users to be welcomed automatically, we could use mw:Extension:NewUserMessage instead. --Yair rand 01:44, 18 February 2010 (UTC)
  12. Special: WT:RFV archiving. If entry exists, and title is striken then move section to a new section of Talk:{PAGENAME}. If title is stricken, and entry does not exist, move to RFVA. BOTH CASES: only if last comment is one week old or older.
  13. Automatically create topic categories when there are pages that are in that category.
  14. Change non-templated translations to use the {{t}} template.
  15. Add {{also}} where necessary.
  16. Notify users when they have added {{rfv}}, {{rfd}}, etc to a page but have not added a section dealing with that page to the appropriate discussion room.

WT:WOTD stuff[edit]

  1. Rebuild alphabetic list of words used in an automatic fashion.
  2. Identify duplicates better, sooner, faster.
  3. Compile various feeder lists better.
  4. Auto-fill next month semi-automatically? No.

Wiktionary:Translations to be checked[edit]

Update the various sections automatically after each XML dump:

  1. Wiktionary:Translations to be checked#Languages nearly complete
  2. Wiktionary:Translations to be checked#Single entries
  3. Wiktionary:Translations to be checked#High-volume languages

WOTDs[edit]

  1. Updated alphabetized list is needed
  2. Auto tag entries of former WOTDs, saying "This word was WOTD on 'suchandsuch' date" as a footnote of some sort
  3. semi-protect the entries themselves the day they are WOTD? Unprotect them the day after.

Periodic Runs[edit]

  • replace.py
    -regex -ref:Template:Plural_of "\{\{plural of\|([^[]+?)}}" "{{plural of|[[\1]]}}"
    -regex -ref:Template:Past_of "\{\{past of\|([^[]+?)}}" "{{past of|[[\1]]}}"
    -regex -ref:Template:Alternative_spelling_of "\{\{alternative spelling of\|([^[]+?)}}" "{{alternative spelling of|[[\1]]}}"

copyvio bot[edit]

For existing English entries on en.wikt, take all text from "#" lines, and compare to default results from dictionary.com (etc.) and when 90%of words therein (or more) match, blank entry and replace with {{rndc}}. (Note: this 1) may not be possible, 2) needs significant discussion on WT:BP before starting.) --Connel MacKenzie 03:59, 16 October 2006 (UTC)

  1. Secondary task: accumulate statistics on such reversions. Auto-block any user with 50% or more copyvios, and autoblank all remaining entries from that contributor. --Connel MacKenzie 04:03, 16 October 2006 (UTC)
We might want to have a look at w:User:Wherebot for comparison or a jumping-off point. It sounds to me like it's doing a fairly similar function on Wikipedia, if not quite as specific. We might want to check etymologies, too. I've seen a few questionable ones go by. —Dvortygirl 04:07, 16 October 2006 (UTC)
  • Other comments from TheDave: weight the match criteria (for only exact matches) if the definition is only one short sentence, and only consists of one definition. **Also have a secondary weight, for tagging entries as "possibles", that need rewording. --Connel MacKenzie 04:36, 16 October 2006 (UTC)

rfvfailed[edit]

Hi guys. Apparently orphaned pages from {rfvfailed} are putting a spanner in the works. For a full discussion see DAVilla's talk page. A suggestion is to alphebetise all {rfvfailed}} entries, and to link them somehow to the relevant [WT:RFVA] page where the full discussion is archived. The link is the difficulty. If there are any other elegant solutions, they would be appreciated. Andrew massyn 19:36, 27 October 2006 (UTC) P.S. If it cant be done automatically, it can be done manuall, using a different system. Please let me know and i will institute the manual way. Andrew massyn 19:39, 3 November 2006 (UTC)

WT:DEL + Killbot.m seems to be handling these fairly well, now. --Connel MacKenzie 06:39, 2 March 2008 (UTC)

Conlang Pronunciations[edit]

Is there any chance that someone could make a bot that adds pronunciation to constructed language entries? It seems that most constructed languages follow pretty simple phonology rules, and it's a pretty big waste of time to add pronunciations to each of them manually... --Yair rand 19:54, 1 February 2010 (UTC)

Category creation[edit]

Bots could create some of the (easy-to-create) categories lying at Special:Wantedcategories --Rising Sun talk? 02:04, 18 February 2010 (UTC)

Links inside form-of templates[edit]

There should be a bot to remove links inside form-of templates. WT:Grease_pit_archive/2010/June#Links_inside_form-of_templates. --Yair rand (talk) 06:42, 4 November 2010 (UTC)

change position of template:beginning Mandarin and similar templates[edit]

Many Chinese words definition (example) begin with beginning Mandarin, intermediate or advanced Mandarin. At best it sounds vague, at worst it seems to mean that the word is not used by native speakers or that is some kind of ancient or esoteric language. It would be more logical to move that to a note section below the definition. Then the template could make it explicit what these categories mean. I have left a message at Appendix talk:HSK list of Mandarin words/Beginning Mandarin. I did not get any reply so I assume I can go ahead an request a bot to do it.--Zolo 06:04, 8 June 2011 (UTC)