User:Sack36/archive-intro

Definition from Wiktionary, the free dictionary
Jump to: navigation, search
Wiktionary:Babel
en This user is a native speaker of English.
Search user languages or scripts

This user is currently working on
Wikisaurus.

My name is Susan Amina Sackinger. You can call me Susan or Amina, either is OK. I have been working on the Wikipedia pages for about 2 years. When I finally heard you were starting up work on a thesaurus project I scrambled over here.

Experience[edit]

I have 35 years of computer experience in all manner of positions. (I was mostly dealing with technophobes in the non-profit arena.) I also am a freelance journalist for some of the gadget news sites and a writer, award winning poet and published editor. (I've also been a semi-pro singer, an artist in acrylics with painting hung in organizations and homes across the US, an artist in oils mainly on the east coast, and a clay and a fiber artist but none of them have much to do with wikisaurus.) It's amazing what piles up over 50 years!

Area of Interest[edit]

Most of my growing years were spent with a book in hand. This made my speaking vocabulary the same level as most people's reading vocabulary and my reading vocabulary at about 4th year college. (with some notable oddities) My interests lie in the non-technical side of the action. (yes, I know, that makes it one of the notable oddities.)

Philosophy of Thesaurus[edit]

I feel Roget's Thesaurus was absolutely stupendous for what it was--a paper classification system for words that mimicked the classification system for plants and animals. With the coming of the computer industry, though, it has become obsolete. I've been reading all about "head words" and "near synonyms" and other such ethereal concepts and I'm thinking that most of the people discussing this are too close to the problem. They no doubt have English degrees, Literature awards and other such ephemera.

Lord knows that's the kind of people who can really do a good job making a dictionary which goes into great depth and encompass so much. I just don't think a Literature major can understand where the average Joe Schmutz is coming from. I have worked in the non-profit area for 25 years. During that time I have been privy to teach many people both illiterate and those of average literacy (6th grade, aprox). None of those I taught would have been able to give you the meaning of synonym let alone the other 13 categories you plan on using. Unless we plan on this being so esoteric as to be useless to any but the top 5% of the literati, we must come up with a simpler plan.

Electronic Thesaurus[edit]

We can visualize the function of a thesaurus by thinking of it as a web that links every definition to every other definition through shades of meaning. Using Head-words removes some of the linkages in this fine web. When you do that, it leaves holes that meaning can fall through. I challenge anyone to find one word in the English language that means exactly the same as another. If the nuance is not in their lexicon, it doesn't mean it isn't there. Ours is a huge and complex language. We shouldn't attempt to simplify it by removing bits of it.

On the other hand, why make work for ourselves by adding extra wordage to each page or extra pages which will be difficult to get to. Such is the case with the main page method of wikisaurus. In order to cover all the areas you leave out, twice as many words must be defined. In order to have a main and sub-categories, you end up with an inability to actually get the users to the category. Just as wiktionary doesn't require wikisaurus assistance until the user needs synonyms, so wikisaurus shouldn't require wiktionary unless an in-depth definition (or any of the other things they provide) is needed.

The other part of this equation is the number of synonyms (or by my way of seeing it near-synonyms all), antonyms and other bits and bobs we want to pack into one area. In reading over all these discussions I've heard everything from duplicate the paper template exactly (Roget's) to "Frag it! Let's dump it all into the wiktionary and have done with it!" There were a lot of steps in between (thank God!) Somewhere along the way, though, I saw that something fairly basic was being assumed that shouldn't have been.

The erroneous assumption[edit]

It's the same assumption that is made by inexperienced pilots; non-mathematicians or physicists. Most of us perceive things in a two dimensional way when we can experience three (pilots) and envision more (physicists and mathematicians.) The arguments were always about what should go on the page and why, but never about how multidimensional we should go. It was touched on when Roget's was brought up ("it's like the skin of an onion".) We're working with a computer here. There is no end to the complexity we can shove on the computer to make it easy for everyone else. ...well, ok, there is an end. We can bring a computer to it's knees if we give it too much to do. I actually experienced that at one of the companies I worked for.(back in the dark ages) The system we had took 8 hours to be booted up.

Flight of Fancy[edit]

We are nowhere near there. I want to give back to the computer some of the complexity we've taken on ourselves. I want the computer to start using those multi-dimensions to make connections we don't have to make with our brains. Let's start by giving it back all the look-ups. We don't care if it has to go through fifteen hundred linkages to get a word defined. Let it. Just so we don't have to. Let's give it three categories: Synonym, Antonym, and Colloquial/Archaic/Slang. At first everything is a Synonym or Antonym. Over the course of time, more "click-thrus" will be done on some synonyms than on others. We keep track of click-tru and when a word drops below a certain threshold, we have the bot transfer the word from synonym into Colloquial/Archaic, or from Colloquial/Archaic into synonyms. Antonyms will be a dead zone.

We could take it several dimensions further by having a bot go through and sort words by click-thru amount; or assign color values to each span of click-thrus and have the bot change the color based on those values; or do both so that we get a rainbow. This way people who don't want to deal with the complexity don't have to while those who want every nuance mapped will get their wish.

The Page[edit]

Every word should have it's own wikisaurus page. If it's in the wiktionary, it should be in the wikisaurus. Our choice is between putting every synonym down twice or having every word have a page. For any given page we do, we won't be having three or four words. In some instances we can have as many as fifty. That many linkages per page can actually slow the computer down more effectively than having fifty more pages.

Every wikisaurus page should have all the words, regardless of part of speech on that single page. Otherwise, how does a person find a wikisaurus page with the right part of speech on it? Would they have to go through as many as four different screens to find the right part of speech before looking up their word? Nobody wants to have to do that! By putting it on the same page and having a clear TOC to find the right spot, it relieves a users time and frustration level.

Summary[edit]

There you have it in one hundred pages or less! We should not have the same structure as wiktionary on our wikisaurus pages and here you can see why this alternate structure is a sane and simple solution. Amina (sack36) 08:17, 11 July 2008 (UTC)

Helpful Links[edit]

Wikisaurus[edit]

shell for 'saurus entries[edit]

user:sack36/sandbox

{{Wikisaurus-header|name}}

Noun[edit]

Sense: name

Synonyms

Antonyms

Sense: name Synonyms

Antonyms

Adjective[edit]

Sense: name

Synonyms

Antonyms

Sense: name Synonyms

Antonyms


Verb[edit]

Sense: name

Synonyms

Antonyms

Sense: name Synonyms

Antonyms


Adverb[edit]

Sense: name

Synonyms

Antonyms

Sense: name Synonyms

Antonyms