User talk:Visviva

Definition from Wiktionary, the free dictionary
Jump to: navigation, search

User talk:Visviva/archive/2006-2007

User talk:Visviva/archive/2008

User talk:Visviva/archive/2009


I do not want to come across as contumelious but please consider casting your vote for the tile logo as—besides using English—the book logo has a clear directionality of horizontal left-to-right, starkly contrasting with Arabic and Chinese, two of the six official UN languages. As such, the tile logo is the only translingual choice left and it was also elected in m:Wiktionary/logo/archive-vote-4. Warmest Regards, :)--thecurran Speak your mind my past 03:16, 2 January 2010 (UTC)

Happy New Year[edit]

Well, well. Look who's back in town. Good to see you editing. Welcome back. -- WikiPedant 03:53, 2 January 2010 (UTC)

Thanks, it's good to be back. -- Visviva 14:49, 16 January 2010 (UTC)


Are you sure that this word is suffixed with side and not a compound word? Sometimes, these tiny compound words can screw some people up ;). Cheers, Razorflame 05:35, 16 January 2010 (UTC)

Not really, but I would imagine that the issue is the same as the other occupants of Category:English words suffixed with -side. I lean towards suffix (or technically, suffixal combining form) for these because there seems to be no common sense of side that satisfies. For example, "on the side of the shore" does not quite suit the meaning of shoreside, at least in modern use. "On the side of the lake" also suggests something rather more specific than lakeside. -- Visviva 14:56, 16 January 2010 (UTC)

Wiktionary:Votes/pl-2009-12/Treatment of toneless pinyin syllables[edit]

I now have found sufficient resources in print to be confident that any toneless pinyin syllable can be verified to exist, in accordance with the CFI. Here are some example works, [1], [2], [3], [4], [5], and toneless pinyin quotes:

  • 2007, Gunter R. Neeb, Blood Stasis: China's Classical Concept in Modern Medicine, p. 165:
    Prescription Chuan Xiong (Ligusticum) 15 g, Ge Gen (Pueraria) 20 g, Dan Shen (Salvia Miltiorrhiza) 30 g, Wei Ling Xian (Clematis) 20 g, Hong Hua (Carthamus) 10 g, Luo Shi Teng (Trachelospermum) 15 g, Tu Si Zi (Cuscuta) 20 g, Jin Ying Zi (Rosa Laevigata) 15 g, Chong Wei Zi (Semen Leonuri Heterophylli) 30 g, Yi Yi Ren (Coix) 10 g, Niu Xi (Achyranthis Bidentata) 15 g, Zhen Zhu Mu (Concha Margaritaferae) 30 g, to be taken as a decoction, Wu Gong Fen (Scolopendra Subspinipes Pulvis) 1 g, and Quan Xie Fen (Buthus Pulvis) 1 g to be taken as powder.
  • 2007, Gunter R. Neeb, Blood Stasis: China's Classical Concept in Modern Medicine, p. 260:
    This symptom was first described by Zhang Zhong-Jing in the 'Shang Han Za Bing Lun' (today's 'Jin Kui Yao Lüe') and treated with Ban Xia Hou Po Tang (Pinellia and Magnolia Decoction).
  • 2007, Gunter R. Neeb, Blood Stasis: China's Classical Concept in Modern Medicine, p. 271:
    Classical categorization of prescriptions into strong prescriptions (Da Fang), mild prescriptions (Xiao Fang), quick prescriptions (Qi Fang), slow-working prescriptions (Huan Fang), even-numbered prescriptions (Qi Fang), odd-numbered prescriptions (Ou Fang) and compound prescriptions (Fu Fang) that combine two or more base formulas.
  • 2007, Gunter R. Neeb, Blood Stasis: China's Classical Concept in Modern Medicine, p. 273:
    Bian Zheng Lun Zhi is derived from the holistic reflection and analysis of the four diagnostic methods (Si Zhen) and eight principles (Ba Gang)...
  • 2004, Michael Friedrich, Thomas O. Höllmann, Handschriften der Yao, p. 353:
    Yi lun zhu chu lai qing fa. Xian nian zhuan shi ming yu qing ge dao zheng shang qing tang dao ming tai qing li dao de ~ Xia jiang ~
    Zhuan tui shen xia lai nan chen si di zhu quan fen di gong shi Hang lu fen li ye.
    Yu huang zhen cheng zhi fa. Xian kou shi jiang shen cun bing ma.
    Yuan shi qu tie suo fu xie ren. Si ji jin xing zhong yi qian diao xie bian chu cheng le ji ye.
    Zao wan shen dou fei zhang jin si zhang fa ji ye. Ji xiang zhuan qu yue fu shi tai qing gong.
    Ji shi shuai zhen jun tong gou qu xiao xiong xing xing ji hao ye.
  • 2003, Bod-ljoṅs sman rtsis khaṅ. ʼJu byed tshan khag, Zang yi xiao hua xi tong ji bing zhen duan ji liao xiao ping ding biao zhun, cover.
  • 1997, Gail Hershatter, Dangerous Pleasures: prostitution and modernity in twentieth-century Shanghai, p. 406:
    Chang zhi xuan ju shi
    xie xie zheng zheng ye yaun zhi
    yi yang zipei yo udeng chai
    zong dai rong hua zhong mian su
    Bie cau jian shang ren chao chi
    kong qun jun zu xuan gao jia
    jue dai e mei ya zhong chi
    dan xi dong li feng gu leng
    he ren ken cai wai cai shi.
  • 1981, Laurence J. C. Ma, Edward W. Hanten, Urban Development in Modern China, p. 89:
    Guo wu yuan guan yu cheng xiang hua fen biao zhun de gui ding (Resolutions of the National Council on the Standards of Demarcation of Urban and Rural Places), Tong ji gong zuo tong xun, No. 12, 1955, p. 4.
    Nan lushi ye diao cha tuan bao gao shu (A Survey Report on Industries in Southwestern Guangdong), np, Guangdong sheng jian she ting, 1933.
    Hainan dao zhi (A Gazetteer of Hainan Island), n.p., n.d.
  • 1955, Zhenlin Chen, Ren ti xiao hua xi ji bing X xian zhen duan xue tu pu: Illustrations of X-ray diagnosis of diseases of the digestive system, cover.

There are many more where these came from. However, even from just this sample, I think it is no longer tenable to doubt that toneless pinyin, generally, is in clearly widespread use. bd2412 T 02:38, 2 February 2010 (UTC)

Bug with RC by language[edit]

Hi. I would like to report a bug. The RC for Armenian fails to record some changes. For example recently it missed this and this. Can you fix it when you're back? --Vahagn Petrosyan 20:54, 20 February 2010 (UTC)

"Add" words not working?[edit]

e.g. User:Visviva/NYT_20070221 — when I click an Add link, the resulting text in the new entry is full of plus signs instead of spaces. Is there an encoding issue? Equinox

I assume this was fixed ages ago, because later pages work fine. Equinox 18:59, 23 June 2010 (UTC)
It was an issue with the JS; it should work on all pages now. (Something got a little messed up when it was switched to MediaWiki space, I think). -- Visviva 19:03, 23 June 2010 (UTC)

RC by language[edit]

In order to make use of your phenomenal tool, I have created WT:RC and added a link to it in the side-bar. If there are problems with this please let me know. Conrad.Irwin 00:03, 11 March 2010 (UTC)

RC by language stopped again[edit]

Hi Visviva, the RC by language worked for 4 days then it stopped working again on 3/28. Would you take a look when you have a chance? Thanks. --Panda10 22:12, 5 April 2010 (UTC)

Thank you![edit]

You are now officially my favourite Visviva - both for fixing and for providing recent-changes by language so as I can stalk people who might know something :). Thanks! Conrad.Irwin 22:57, 19 May 2010 (UTC)

Wiktionary:Votes/2010-04/Voting policy[edit]

I urge you to vote. (I don't know which way you'll vote, but I want more voices, especially English Wiktionarians' voices, heard in this vote.) If you've voted already, or stated that you won't, and I missed it, I apologize.​—msh210 17:00, 21 May 2010 (UTC)



Can I ask you to create this entry, please? --Anatoli 00:00, 25 May 2010 (UTC)

Thanks. :) --Anatoli 05:30, 25 May 2010 (UTC)

Pumped up watchlists[edit]

So Conrad and I were talking about how cool it'd be to have a rule based watchlist. So, if a person wanted to be fed all changes to French entries/sections, Italian, and etymologies, they could get a page which had all that. Do you think that'd be feasible to implement on fraktionary? Is there some RC-by-language code we could look at and start monkeying with? Thanks. -Atelaes λάλει ἐμοί 16:56, 5 June 2010 (UTC)

Although I learned a lot from writing the RC script, the code is pretty much unsalvageable. The analysis and presentation are all slopped together, and the moving parts are spread out around about 10 different files (that's how that nasty little worm got by me). I could post it for amusement, but basically it needs to be rebuilt from scratch to be scalable.
On that note, the cycle() function in User:Visviva/Python/ has been running OK for me for the past hour or so. I think it would provide the kind of functionality you're looking for. However, it has no front end yet, just dumps a list of the affected sections for each edit to the shell. Taking that data and generating the HTML and RSS a la the current RC-by-language pages should be straightforward but time-consuming.
My goal with is to have it do a full logical mapping of each Wiktionary entry, which could then be interpreted into any of the various dictionary interchange formats out there, or mashed up with data from other sources like WordNet. Ways to go yet. -- Visviva 01:27, 6 June 2010 (UTC)

Wiktionary:Grease pit#What happened to my preferences?[edit]

Just making sure you had seen this, as most of it doesn't have to do with preferences anymore. -Atelaes λάλει ἐμοί 01:06, 10 June 2010 (UTC)

User:Visviva/Medical/By links/Y[edit]

This page is done. Delete the page? I've deleted some entries in other pages that were blue links, but I'm wondering now if we should keep these pages as some kind of record. What is your preference? bd2412 T 14:42, 21 June 2010 (UTC)

Delete if you feel like, but it'll probably be recreated with some fresh red links whenever I do the next pass. (Alternatively, maybe the next pass should go onto subpages of the Medicine project?) -- Visviva 19:06, 23 June 2010 (UTC)
Next pass? What else is there to get? Having these as subpages of the Medicine project sounds like a very good idea, though. bd2412 T 20:01, 23 June 2010 (UTC)


FYI, I have sent one sense of "compositional" to RFV, as I cannot find quotations. Maybe you would like to add some quoations? --Dan Polansky 12:26, 24 June 2010 (UTC)

character info[edit]

Would it be easy for you to automatically add {{character info}}-type templates to the character entries that exist and don't yet have one? If so, I can try and setup the remaining block templates that haven't been made. --Bequw τ 23:08, 1 July 2010 (UTC)

Foreign verb definitions[edit]

Thank you for your clarification in Wiktionary:Style guide#Types of definitions, as it has been very helpful to me. I'm getting anxious about the style of definitions for Finnish entries. I've been seeing a lot of long definitions that could be simple glosses without losing much information. There are also simple glosses followed by a long definitions in parenthesis in the same line. For example:

osoittaa — To show (to indicate (a fact) to be true; to demonstrate).

One reason that simple gloss translations are preferable for foreign words is that they are more machine readable. They can easily be parsed and indexed into bilingual dictionaries. For languages like Finnish, Wiktionary is the most complete source of information available, so simple glosses make Wiktionary much more exploitable and accessible.

So I have three questions.

  1. Should to be capitalized for simple gloss translations of verbs?
  2. If a gloss is followed by a clarifying statement, should I leave it on the same line with a capitalized To and a period? Which leads to the next question,
  3. When organizing by senses, does each definition have to present a different shade of meaning, or can it merely be a restatement of the definition above it? More specifically, can a sentence fragment definition just be a clarification of the gloss above it?

heyzeuss 12:42, 19 August 2010 (UTC)

Wiktionary:Deletion requests#Wikiquote[edit]

Hello. Since you participated in the deletion discussion above, I thought I might like to hear some input from you regarding this one. Thanks. TeleComNasSprVen 12:32, 26 September 2010 (UTC)

Template transclusion script[edit]

Hi Visviva, I'm looking for a way to translcude/substitute templates in a dump file of the English Wikipedia, and then I've found your My question is: how does it work? Would it be possible to use it to transclude/substitute templates in a (offline) dump of the English Wikipedia? Regards, Morail 15:18, 17 November 2010 (UTC)


You entered this as a Korean abbreviation of after service. I've changed it to English; I'm assuming you mean English as Korean doesn't have the words after and service. --Mglovesfun (talk) 13:17, 19 May 2011 (UTC)


Hi there. Any chance User:Visviva/Elsewhere can be remade? Most of the links are blue now. Thanks. --Simplus2 09:54, 7 December 2011 (UTC)

Request for deletion of Middle Korean entries[edit]

I have started a request for deletion of Middle Korean because the source is not clear. I hope you will consider joining the discussion at Wiktionary:Requests_for_deletion#Deletion_of_Middle.2FOld_Korean.2C_Silla.2C_Goguryeo_and_Baekje. --BB12 (talk) 04:17, 25 November 2012 (UTC)

Changing {{SAMPA}} to {{X-SAMPA}}[edit]

Hi. I’m getting rid of this redirect everywhere. I found an instance of this template used in your page User:Visviva/ipa sampa subst, and thought twice about it. Would you mind updating it? Thanks. Michael Z. 2013-02-17 01:55 z

I see you’re not around, so I’ve gone ahead and made the change.[6] Regards. Michael Z. 2013-02-22 22:07 z

Q about 우리 and [edit]

Hello Visviva --

I've been slowly figuring out KO morphemes and roots. I noticed that you added a note to when you substantially expanded the entry in 2010, describing the "we, us" sense's etymology as a contraction of 우리.

Might it be the opposite, in fact? I mean, where 우리 might be derived as + , much as some other modern pronouns appear to be formed by fusion with this particle, like and .

I'd appreciate any additional light you could shed on the relationship between 우리 and . Cheers, ‑‑ Eiríkr Útlendi │ Tala við mig 18:06, 11 February 2014 (UTC)

mojibake and 외계어[edit]


Isn't 외계어 (外界語) also a valid translation for mojibake and a valid, attestable term, even if it's jocular? Russian also uses a jocular term for mojibake. --Anatoli (обсудить/вклад) 23:42, 11 February 2014 (UTC)

Welcome back[edit]

We missed you. DCDuring TALK 14:25, 30 May 2015 (UTC)

Thanks! I seem to actually be back this time. -- Visviva (talk) 07:40, 26 June 2015 (UTC)


Do you still have the code for this? I'd be interested in getting more lists. DTLHS (talk) 05:17, 26 June 2015 (UTC)

I definitely have some pieces of the code; e.g. here. May take a while for me to remember how everything fit together. Some sources (like the NYT) may now be out of reach. Will advise. -- Visviva (talk) 07:39, 26 June 2015 (UTC)
Dear lord, what an ungodly mess of spaghetti code. I want to go back in time and smack some sense into myself. But I have updated it & done some runs on recent NYT and Guardian issues, culling manually (don't have a backup of my old stopword/misspellings list). The pickings do seem to be slimmer than they used to be, although I wonder if I'm maybe just not getting all the text. Next step, set it up to run automatically so that it isn't quite so dependent on my presence of mind. -- Visviva (talk) 05:26, 27 June 2015 (UTC)
Thanks! The smaller lists are encouraging, since we seem to be actually making progress (thanks @Equinox:) DTLHS (talk) 05:28, 27 June 2015 (UTC)
If you feel up to it, maybe could be added (they're very strict about robots scraping their site though). DTLHS (talk) 03:31, 30 June 2015 (UTC)
Probably out of reach; I think I'd be OK as far as their bulk access policy, but PDFs are in general way more struggle than they're worth. (Even once you wrangle the text out of them, you tend to end up with a problematically large number of non-words and non-sentences.) The LaTEX source files might be more viable, but that would be a significant learning curve. I had however been thinking of dabbling in PubMed Central a bit, if there's anything there that might be of interest. -- Visviva (talk) 05:29, 30 June 2015 (UTC)
That's true, although just the HTML abstracts seem to provide plenty of new words. DTLHS (talk) 20:45, 1 July 2015 (UTC)
Happy hunting! -- Visviva (talk) 05:16, 2 July 2015 (UTC)
One other thing, how hard would it be to replace the $ tags with <math> and </math>? DTLHS (talk) 04:32, 7 July 2015 (UTC)
Harder than I thought. That is, the replacing is easy, but it breaks the links horribly:
2015, J. J. Nuño-Ballesteros, B. Oréfice-Okamoto, J. N. Tomazella, “Equisingularity of map germs from a surfasse to the plane”[7], arXiv: 
As an application, we show that a family (X_t,0) is Zariski equisingular if and only if it is Whitney equisingular and the numbers of cusps and double folds of a generic linear projection are constant on t..
add - notemp
I surmise that the reason for this is that the parser handles the <math> tags relatively early in the process (so that they're already rendered before it renders the URL). There must be a way around this (I'm thinking perhaps a template that substs as <math>), but I'll have to gnaw on this a bit. -- Visviva (talk) 06:30, 7 July 2015 (UTC)
I can't believe this actually works, but it seems to (except for rare cases where the <math> code contains its own double curly brackets, in which case a manual <nowiki> insertion is probably required).
2015, J. J. Nuño-Ballesteros, B. Oréfice-Okamoto, J. N. Tomazella, “Equisingularity of map germs from a surfasse to the plane”[8], arXiv: 
As an application, we show that a family <math>(X_t,0)</math> is Zariski equisingular if and only if it is Whitney equisingular and the numbers of cusps and double folds of a generic linear projection are constant on <math>t</math>.
add - notemp
Will set the script to make that substitution starting tonight. (The more sensible thing would probably be to recode this whole ParserFunctions house of cards in Lua, but that seems like work.) -- Visviva (talk) 17:32, 7 July 2015 (UTC)


Please don't use templates from your user space in other namespaces (other than discussion pages). —CodeCat 15:45, 1 July 2015 (UTC)

Yes, sorry about that. Think I have zapped them all now. -- Visviva (talk) 16:04, 1 July 2015 (UTC)

PMC 20150702[edit]

Should this be "BMC 20150702". SemperBlotto (talk) 10:59, 3 July 2015 (UTC)

No, the idea is to scan all fresh article postings on PubMed Central (PMC) -- it's just an odd coincidence that the articles released on this date were all from BMC. -- Visviva (talk) 16:55, 3 July 2015 (UTC)
OK. By the way - I have been deleting your Guardian subpages when all the words have been added. Is that what I am supposed to do? (I can get them back if not) SemperBlotto (talk) 19:56, 3 July 2015 (UTC)
p.s. I don't suppose you fancy harvesting articles from [9] (PLOS one).
Yes, thanks for deleting them. I hadn't quite worked up to doing that, but I don't think there's any good reason to keep them around once they've served their purpose. Will put PLoS One on my to-ponder list; have dug myself a bit of a time-hole at the moment.-- Visviva (talk) 20:11, 3 July 2015 (UTC)

Spanish Tracking[edit]

Hey. While you're doing some tracking, would it be possible to generate some for Spanish too? Ones like Robert did a long time ago are great, although now they have got mostly blue links. --A230rjfowe (talk) 19:13, 4 July 2015 (UTC)

Now rolling for El País; let me know if anything seems seriously haywire. If you use the "add" links, be aware that I haven't added any Spanish functionality to the POS-guessing function, so it basically thinks everything is a noun. -- Visviva (talk) 19:04, 5 July 2015 (UTC)
That's great. Thanks a lot for unrolling it. It would be even better if it could include any pages which have a Wiktionary entry but not in Spanish. --A230rjfowe (talk) 13:00, 6 July 2015 (UTC)
That does put things in a different light. (Asterisks mark false blue links, although I see you've already made a few of them into true ones since the last dump.) Enjoy! -- Visviva (talk) 19:09, 6 July 2015 (UTC)
I wonder if it would be useful to have those links go to a "new section" page when there is an existing entry for another language, like so. Not sure if there is currently a bot that puts sections back into order... -- Visviva (talk) 19:33, 6 July 2015 (UTC)
Yeah, that seems a good idea. Most of the "false blue links" appear to be verb forms - especially forms of the past participles which have an entry in Portuguese or Galician, and I generally shirk away from creating them, leaving them for a bot to create. --A230rjfowe (talk) 06:58, 9 July 2015 (UTC)
  • Can you stop making these pages, please. It gives me far too much work to do. Also, I start my holiday tomorrow --A230rjfowe (talk) 00:37, 2 August 2015 (UTC)
    • Sure thing. But please don't consider these lists an obligation in any event. I think they can serve useful functions over time (providing backlinks and the like) even if they aren't being blued out regularly. (At least that's what I tell myself to excuse the fact that I still have lots of 5-year-old lists sitting around that I've never cleared.) -- Visviva (talk) 01:47, 2 August 2015 (UTC)
      • Hey. If you want to roll the Spanish ones again, it'd be cool. Mostly to see how much progress we've made with them. --A230rjfowe (talk) 22:21, 25 August 2015 (UTC)
        • It really would be brilliant to roll El Pais again. I'm clearing the older Pais ones, with the help of one or two other excellent Wiktionarians, and they're nearly done. My holiday, BTW, was excellent. --Zo3rWer (talk) 14:27, 12 September 2015 (UTC)
          Glad to hear! Sorry I've been a bit out-to-lunch myself. These should start rolling again as of tonight. -- Visviva (talk) 01:07, 13 September 2015 (UTC)
          • Thanks. I see DTHLS has already started adding words from them. --Zo3rWer (talk) 12:48, 14 September 2015 (UTC)


Thanks for that. As I suspected, it will generate lots of weird mathematical terms that I won't be able to find a definition for. Cheers. SemperBlotto (talk) 07:57, 8 July 2015 (UTC) p.s. Do you know why User:Visviva/arXiv 20150707 etc, User:Visviva/PLOS 20150706 and others do not show up as subpages in your user page? SemperBlotto (talk) 10:23, 8 July 2015 (UTC)

I've noticed the "add" button tries to guess at the part of speech. For words ending in -ed, I think Adjective is usually a better guess than Verb. (Where it is in fact a verb, I modify the URL to create the lemma instead.) Equinox 21:10, 8 July 2015 (UTC)
That's a good point -- I wonder if it varies based on the source material; it seemed from working through old newspaper pages that most -ed words were verb forms, but I suspect you're right that this doesn't transfer to journals. -- Visviva (talk) 02:25, 9 July 2015 (UTC)
Also (mainly because of the sciences) an "inter-" prefix suggests adjective, not noun. Equinox 07:51, 9 July 2015 (UTC)
Having tricked myself into putting the wrong POS on an entry quite a few times, I'm wondering if perhaps the smarter thing is just to make a really ugly invalid header (like "===POS===") so that it will be immediately obvious that it needs to be fixed. -- Visviva (talk) 18:39, 9 July 2015 (UTC)
Well, I mainly like it how it is. I get really sick of typing "Adjective" and "Etymology" all the time; I had a Chrome browser plugin that let me enter them with shortcut keys, but I could never find it again after losing my settings. Equinox 22:59, 11 July 2015 (UTC)
Re the pages not showing up: I think it's because I have too many subpages (Special:Prefixindex conks out at /M, so it doesn't get to the lowercase pages at all). I've rearranged my user page a bit so that these particular ones are findable, but ultimately I probably need to go on a serious deletion spree. -- Visviva (talk) 05:59, 9 July 2015 (UTC)
https and eppendorf are useless items that occur with very high frequency in your science-mag scrapings; perhaps exclude? Equinox 20:12, 12 August 2015 (UTC)

French tracking[edit]

Hi V, thanks for the Spanish tracking again. Any chance of tracking some French newspapers too? I'd ask about Asturian papers too, but there aren't any. --A230rjfowe (talk) 13:48, 14 July 2015 (UTC)

Here's a start; let me know if it suits. I've switched this and El País to use {{m}} for the magical orangelinks. -- Visviva (talk) 20:23, 16 July 2015 (UTC)
Yeah, it suits. Lots of orange links there, which leads to finding incredibly common words like construction and compensation missing in French. Thanks. I'm sure if you do some Italian tracking too, SB will be really happy. Hey, why not do every single language in the world? --A230rjfowe (talk) 23:48, 16 July 2015 (UTC)
Of course I would. But I can't keep up with PMC and PLOS and haven't done any Italian for yonks. SemperBlotto (talk) 07:23, 17 July 2015 (UTC)

PLOS misspellings[edit]

Hi there. There seems to be quite a lot of misspellings here. I will continue to add misspelling entries for those that seem to be a little more common, but if they seem to be more or less unique I am going to ignore them if that is OK with you. (e.g. "apecies" for "apices") SemperBlotto (talk) 07:40, 16 July 2015 (UTC)

Fine by me; they don't really seem worth spending time one for the most part. I've been fiddling with a rudimentary typo filter, but an awful lot of them are still getting through. (At the moment I have the filter set to assume that anything that appears more than once is not a typo, but in a lot of these cases it appears that the author had entered the misspelling into spellcheck, so that it occurs several times in the document -- so I may try flipping that switch the other way and seeing what happens.) -- Visviva (talk) 20:27, 16 July 2015 (UTC)


Hi there. I noticed that this term was present in a citation included in User:Visviva/PLOS 20150710, but wasn't listed as missing. Is there a reason you are not including such things? (as it happens, I don't know how to define it!) Cheers SemperBlotto (talk) 16:12, 19 July 2015 (UTC)

As currently set up, the script filters out anything that's italicized; there's also a less-uniformly-successful filter that tries to zap anything that looks like a genus-species binomial even if not italicized. These names aren't, after all, exactly English -- but perhaps a separate sublist would be a good idea? -- Visviva (talk) 17:55, 19 July 2015 (UTC)
No, that's OK. I have my own scheme for finding such things (but I haven't got the time to do it any more!). SemperBlotto (talk) 19:47, 19 July 2015 (UTC)

Questions about tracking[edit]

What are your requirements with respect to a source for tracking? I was thinking that perhaps the abstracts of scientific journals in fields that I should know well enough to create entries for any missing terms (like geology and palaeontology) would be more productive for me, if you'd be willing to generate them, but I don't know whether there are any other limiting factors. Thanks! —Μετάknowledgediscuss/deeds 01:45, 2 August 2015 (UTC)

The big limiting factor is having some public source that can be scraped automatically at regular intervals. PubMed and PLOS ONE are very handy that way, since there's a daily fount of fresh full-text material (except when PubMed's API is having the hiccups). But if we're just going after abstracts, a lot of things that are difficult/impossible to access in full text become much more doable. (Though by the same token, the terminological pickings can be relatively slim; I played around with doing a daily scrape of SSRN abstracts but got < 5 new words per day, which didn't seem worth it.) I'll try to take a crack at this sometime soon, see how it goes. Are there any particular journals you're especially interested in? -- Visviva (talk) 02:03, 2 August 2015 (UTC)
Thanks. Journals like Geology, Paleobiology, Journal of Micropalaeontology, Journal of Paleontology, and Journal of Sedimentary Research are good stuff. How easy is it to scrape things? I can copy-paste large, promising chunks of text into files if you can work your magic on them, because my current method is just to wikify (well, {{l}}ify) text and then do it mostly manually, which is pretty ineffective. —Μετάknowledgediscuss/deeds 03:47, 2 August 2015 (UTC)

What exactly do the edit-summary messages mean (e.g. "gardening", "sowing")? Equinox 07:57, 29 August 2015 (UTC)

Just a gesture toward the concept of the bot as gardener; also they give me a quick way to find which part of the code is responsible for a particular page/edit. -- Visviva (talk) 01:20, 4 September 2015 (UTC)
  • Note: I think I'm going to switch these lists off for the moment, as I can't seem to find the requisite time to do even the most basic maintenance/supervision. -- Visviva (talk) 01:20, 4 September 2015 (UTC)
  • They actually seem to be working quite well without any maintenance! Equinox 01:22, 4 September 2015 (UTC)
  • If you want to have a break from generating these lists, that's fine by me. We already have masses to process - and it takes so much time! SemperBlotto (talk) 10:57, 5 September 2015 (UTC)
    In view of somewhat conflicting signals from different quarters, I haven't hit the general kill switch. I've switched off PMC, since that was going to be a personal project for me and I haven't been around to work on it. Should I nuke PLOS also? -- Visviva (talk) 00:56, 13 September 2015 (UTC)
    I still process PLOS - though I have taken a short break to work on French lately. I will be back to work on PLOS soonish. SemperBlotto (talk) 07:31, 20 October 2015 (UTC)
  • And what ought one to do with pages where everything has been created or sequestered, like User:Visviva/NYT 20090317? Delete them? —Μετάknowledgediscuss/deeds 20:27, 14 September 2015 (UTC)

Pais links[edit]

For the page on September 17th and the sentence "La cerrazón húngara ha propinado un fuerte varapalo al éxodo de migrantes.", I noticed the link was to this article where the sentence doesn't appear. This appears to be the actual article. Do you know what's going on here? DTLHS (talk) 18:31, 17 September 2015 (UTC)

Dashboard entries[edit]

Hi there. I started to delete these (they don't seem to do anything) then realised that I ought to ask you. Do you still need them? SemperBlotto (talk) 05:36, 19 October 2015 (UTC)


I think this is Catalan (if the link starts with ccaa it's from the Catalan portion of the site - maybe you could segregate these). --Zo3rWer (talk) 14:48, 1 November 2015 (UTC)