Wiktionary talk:Frequency lists/Esperanto/Wikipedia 2011 11K

From Wiktionary, the free dictionary
Jump to navigation Jump to search

So, I attempted to wikify the final list sections, however, t got flagged as Vandalism. I am willing to finish, if it will stop calling me a vandal.

Also, it should be noted that due to previous edits, the word counts are incorrect. The first few hundred have a hundred each, but around the 400 or 500 mark, 25 words are missing. This trend continues throughout the page. Fixing that would have been my next task.

BillDStrong (talk) 18:32, 24 March 2015 (UTC)[reply]

What are you doing and what warning are you getting? Renard Migrant (talk) 18:50, 24 March 2015 (UTC)[reply]

Specifically, I am taking the lists and adding both an asterisk then enclosing the word in double brackets. I cannot seem to do this on a large collection, say a full list of a thousand words, which I understand is meant to protect against vandals. However, making this edit multiple times with less than a hundred words also triggers the block. I have tried going away and coming back, but it is rather frustrating.

BillDStrong (talk) 21:24, 24 March 2015 (UTC)[reply]

Alright, finished wikifying the lists. The issue that ran into were two bogus words consisting of nothing but repeating d's and e's respectively, that were triggering the vandalism bots.

BillDStrong (talk) 06:17, 25 March 2015 (UTC)[reply]

Update possible?[edit]

This statistic is quite old and the Esperanto Wikipedia has surely grown a lot since then. If I create a new version based on a recent dump, am I allowed to update the page? Krissie (talk) 09:28, 30 January 2023 (UTC)[reply]

@Krissie If you're willing to create another, that would be fantastic! I'm currently in the process of restructuring the whole way we organise the collection at Wiktionary:Frequency lists and would suggest creating the new one at /Esperanto/Wikipedia 2023, if you're not opposed. P.s. rather than generating a new one, you might be able to find a suitable one here. The list will need some cleaning but other than that they're pretty good. Helrasincke (talk) 05:53, 11 February 2023 (UTC)[reply]
@Helrasincke: Sure, I'm indeed working on a new list and will add it under the link you suggest. I'm currently in the clean-up stage – filtering out material in other languages and person names, preferably without also discarding valid Esperanto words. Which is tricky because of Esperanto's highly flexible word-building system. But it shouldn't take too much longer now.
The Uni Leipzig corpus is interesting, but they only use a random subset of the Vikipedio (300,000 sentences), while I work with the complete Vikipedio (more than 300 mio. characters). Krissie (talk) 19:08, 15 February 2023 (UTC)[reply]
Still working on the data cleanup, but I think my new list will be ready in April. Krissie (talk) 11:46, 17 March 2023 (UTC)[reply]
I've uploaded the new list: Wiktionary:Frequency lists/Esperanto/Wikipedia 2023 Krissie (talk) 15:19, 27 March 2023 (UTC)[reply]