User talk:Angr

Definition from Wiktionary, the free dictionary
Jump to: navigation, search

Diacritics[edit]

Hi, was reading the page on Ancient Greek in Wiktionary and saw your name at the bottom. Maybe you can help me or point me to the right contact. I am not sure how to deal with some grc entries in Wiktionary. There are terms that differ only by a diacritic. See for example σέλῑνον (sélīnon, celery) in parsley and σέλινον (sélinon) in sedano. As I am extracting etymological data from Wiktionary, I would like to collapse those two entries into one entry only. Do you think I can use some kind of general rule, e.g., replace ῑ with ι? Or should I just keep both to represent data in Wiktionary? The link to the tool I am developing is etytree Epantaleo (talk) 15:49, 13 October 2017 (UTC)

As you might have noticed, both links go to the same entry; the macron is just extra information on vowel length that is usually not represented outside of headwords. You can find all the diacritical marks that are stripped for each language by looking at the subpages of Module:languages. —Μετάknowledgediscuss/deeds 15:52, 13 October 2017 (UTC)
(edit conflict) Ancient Greek entries never include the long and short marks, though they may be displayed. Thus the two links you provided above both point to the same page name, σέλινον. The long and short marks are always optional; if you aren't sure, you can leave them out. Other Ancient Greek diacritics, however, are not optional and are part of the page name: the acute and circumflex accents, the rough and smooth breathings, and the diaresis are all required parts of the page name. —Aɴɢʀ (talk) 15:54, 13 October 2017 (UTC)