Module talk:languages/data/exceptional

From Wiktionary, the free dictionary
Jump to navigation Jump to search

Addition[edit]

Please add this for cel-pro:

	sort_key = {
		from = {"[ɸφ]"},
		to   = {"f"}},

CodeCat 14:12, 18 October 2016 (UTC)[reply]

@CodeCat: Done. —JohnC5 14:50, 18 October 2016 (UTC)[reply]

False statement[edit]

In

m["sai-chn"] = {
	"Chaná",
	"Q5072718",
	"sai-crn",
	otherNames = {"Chana", "Sari"},
	scripts = Latn,
}

there is a mistake or false statement.

Chaná language is not known by a alternative name "Sari". None reliable sources have even mentioned this supposed name. English Wikipedia it's the cause of the widespread of this mistake. The only "source" which assign this term to chaná language is linguist list, but they wouldn't even know anything about this language neither direct sources about it, and I can say that because they've mistaken it as it would be an african language, when it is not. You can find some sources on chaná language here: w:es:Discusión:Lenguas_charrúas#Sobre_el_chaná. --Zerabat (talk) 22:40, 1 March 2018 (UTC)[reply]

I fail to see how an alternative name that does not actually see use could be a problem (and it does account for LinguistList's confusion). We could remove it, though. —Μετάknowledgediscuss/deeds 02:49, 2 March 2018 (UTC)[reply]
I've removed it. Since there apparently are African languages called Sari, it's probably confusing to leave it in the list of alt names. I would've suggested leaving it as a -- commented-out note about Linguist List's use of it for any future users searching this site for "Sari" in connection with this language, but this discussion accomplishes that just as well. - -sche (discuss) 03:00, 2 March 2018 (UTC)[reply]

Addition of Idiom Neutral[edit]

Please add the following constructed language. Some Ido words (such as partiso) are borrowed from Idiom Neutral, and it would be good if we could indicate this properly.

m["art-idn"] = {
	"Idiom Neutral",
	"Q35847",
	"art",
	type = "appendix-constructed",
	scripts = Latn,
}

Robin van der Vliet (talk) (contribs) 12:03, 19 April 2018 (UTC)[reply]

If we only need to cite it in etymologies, I think it'll be sufficient that I now added it as an etymology-only language under the code "und-idn". If you wanted to include entries in the language, even in the appendix space, I think that would need more input/approval, since we've tended to be much less inclusive of little-used / ISO-uncoded constructed languages than of natural languages. - -sche (discuss) 04:39, 24 April 2018 (UTC)[reply]
-sche is correct on all counts. You are welcome to create a discussion on adding it to the Appendix at the Beer parlour. As for the code, we've used art- for constructed languages in the past — is that something we should be avoiding, or is it listed in some standard? —Μετάknowledgediscuss/deeds 05:34, 24 April 2018 (UTC)[reply]

Proto-Totonacan[edit]

Please add:

m["nai-ttn-pro"] = {
	"Proto-Totonacan",
	"Q63436996",
	"nai-ttn",
    otherNames = {"Proto-Totonac-Tepehua"},
	type = "reconstructed",
	scripts = {Latinx},
}

See Proto-Totonacan for some sources. Thanks, Julia 01:58, 3 May 2019 (UTC)[reply]

[cel-gau] Gaulish language script amendment needed[edit]

The corpus of the Gaulish language uses 3 different scripts: Latin, Greek and Etruscan. See this wikipedia page for more details.

Please, in "cel-gau" someone add "Ital" to the list of script in addition to "Latn" and "Grek". The line should read:

scripts = {"Latn", "Grek", "Ital"},

— This unsigned comment was added by Edo999 (talkcontribs) at 09:51, 28 July 2019 (UTC).[reply]

@Edo999: I see that the article you link to (w:Gaulish language § Inscriptions) mentions that the Etruscan alphabet is used by Lepontic, but Wiktionary distinguishes that from Gaulish, and the Lepontic language already has Italic in its list of scripts. And currently no Gaulish links use Italic script (search for "cel-gau" in User:Erutuon/scripts in link templates). I don't see Italic-script entries listed in Category:Gaulish lemmas either. So it's not clear to me that Gaulish needs Italic in its list of scripts. Does non-Lepontic Gaulish use Italic script? — Eru·tuon 14:35, 28 July 2019 (UTC)[reply]
@Erutuon: Thank you for your comment. Yes, Cisalpine Gaulish uses Etruscan script. The page "Liste des inscriptions en gaulois" on French Wikipedia has Gaulish samples in the three different scripts (Etruscan, Greek and Latin). So do the literature about Gaulish language (I'm currently reading La langue gauloise by Pierre-Yves Lambert that has one chapter for inscriptions in each of the 3 scripts). Considering Lepontic as an early form of Gaulish or as a distinct Continental Celtic language is still a matter of debate. Either way, Gaulish (in the former hypothesis) and Cisalpine Gaulish (in the latter one) both have inscriptions in Etruscan. I hope that makes sense, let me know if not! — This unsigned comment was added by 185.45.4.4 (talk) at 16:03, 2 August 2019 (UTC).[reply]
@185.45.4.4: Thanks, I've added the script on the basis of what you've said. — Eru·tuon 19:50, 2 August 2019 (UTC)[reply]

Transliteration for Early Assamese[edit]

@AryamanA Hello. Earlier Early Assamese used Module:inc-mas-translit (for Middle Assamese) but due to phonological differences, I want to separate them. I already created Module:inc-oas-translit. Can you replace it? Msasag (talk) 06:33, 26 October 2020 (UTC)[reply]

@Msasag: Sure! —AryamanA (मुझसे बात करेंयोगदान) 15:15, 26 October 2020 (UTC)[reply]

Pamunkey Language[edit]

add this

m["nai-pky"] = {
	"Pamunkey,
	"Q106415443",
  	scripts = Latn,
}

--101.98.133.254 04:59, 8 April 2021 (UTC)[reply]

Old Catalan as a descendant of Old Occitan[edit]

No scholarly source supports such a notion; the two languages were contemporaries. For more commentary, see Wiktionary:Beer parlour/2021/December#Catalan and its supposed descent from Old Occitan. Please change the ancestor of Old Catalan from "pro" (Old Occitan) to "la" (Latin). Nicodene (talk) 05:40, 26 December 2021 (UTC)[reply]

alg-pro (Proto-Algonquian) normalization request[edit]

Could someone automatically change "ᐧ" in alg-pro links to "·"? Some notable sources (e.g., Proto-Algonquian Online Dictionary) use this other symbol. I just noticed this after copying and pasting reconstructed forms to several entries. It is easy to overlook. 70.172.194.25 21:38, 25 February 2022 (UTC)[reply]

Fix names of apabhramshas and prakrits[edit]

It seems like many of the names for the apabhramshas and prakrits are transliterated incorrectly. For example, all the apabhramsha names read as "Apabhramsa", which is incorrect. Similarly, names like Shauraseni read as "Sauraseni", which is also incorrect. All of these need to be fixed. Getsnoopy (talk) 18:58, 11 June 2022 (UTC)[reply]

Regarding Middle Tamil[edit]

Hello, there is a lang-code called 'ta-mid' for Middle Tamil which has to be removed since it does not meet the required format specification for creating a lang-code, instead a new lang-code has to be created for Middle Tamil (dra-mta) like there is one for Middle Kannada (dra-mkn). 'dra-mta' has be made the ancestor of 'ta' (Tamil). And 'oty' (Old Tamil) has to be made the ancestor of 'dra-mta (Middle Tamil).' Emmanuel Asbon (talk) 15:54, 21 August 2022 (UTC)[reply]

Regarding Proto-Yoruboid[edit]

Can Volta-Niger (alv-von) be added to the Proto Yoruboid (alv-yor-pro) ancestor language tree to reflect that it is a direct ancestor of Proto-Yoruboid before reaching Volta-Congo, which is the ancestor to Volta-Niger. Oniwe (talk) 16:23, 3 September 2022 (UTC)[reply]

Belter Creole[edit]

'art-blt' is already in use for a few category names for Belter Creole, so it probably needs to be added to this module.

m["art-blt"] = {
	"Belter Creole",
	108055510,
	"art",
	Latn,
	type = "appendix-constructed",
}

SweetPotatoGolem (talk) 22:37, 26 December 2022 (UTC)[reply]

Requested edit[edit]

Change "Module:inc-mbn-translit" to "Module:bn-translit". Sbb1413 (he) (talkcontribs) 09:28, 14 March 2023 (UTC)[reply]

@Sbb1413 Are there any functional differences between the two, or is Module:inc-mbn-translit simply pointless duplication? Theknightwho (talk) 10:19, 14 March 2023 (UTC)[reply]
Module:inc-mbn-translit is redundant as Module:bn-translit will work just well and the inherent vowels are also rendered quite well with some exceptions. Sbb1413 (he) (talkcontribs) 10:22, 14 March 2023 (UTC)[reply]
@Sbb1413 Would it be worth making Module:inc-mbn-translit run Module:bn-translit as its first task? So that then all that needs to be specified are the things that are different. I did something similar with Module:Deva-as-Beng-translit, which effectively just modifies the result from Module:Deva-Beng-translit. Theknightwho (talk) 11:03, 14 March 2023 (UTC)[reply]
@Theknightwho I have done this at Module:inc-mbn-translit/sandbox but shows no output. Sbb1413 (he) (talkcontribs) 13:18, 26 April 2023 (UTC)[reply]

Requested edit 4 April 2023[edit]

Add the following: m["inc-obn"] = { "Old Bengali", nil, "inc-eas", "Beng", ancestors = "inc-mgd", translit = "sa-translit", } Sbb1413 (he) (talkcontribs) 09:41, 4 April 2023 (UTC)[reply]

Proto-Slavic ś should be an exception for removing accents[edit]

Please append smth like , remove_exceptions = {"Ś", "ś"} to the entry_name conversion in the Proto-Slavic entry:

m["sla-pro"] = {
	"Proto-Slavic",
	747537,
	"sla",
	"Latinx",
	type = "reconstructed",
	entry_name = {remove_diacritics = c.grave .. c.acute .. c.tilde .. c.macron .. c.dgrave .. c.invbreve},
	sort_key = {
		from = {"č", "ď", "ě", "ę", "ь", "ľ", "ň", "ǫ", "ř", "š", "ś", "ť", "ъ", "ž"},
		to = {"c²", "d²", "e²", "e³", "i²", "l²", "nj", "o²", "r²", "s²", "s³", "t²", "u²", "z²"},
	}
}

Otherwise, this reconstructed phoneme is not distinguished from s, which leads to one of the links at вьсь being incorrect (it should be to vьśь). Imz (talk) 02:03, 22 April 2023 (UTC)[reply]

 Done two days after this request, although I notice it only now. Catonif (talk) 20:22, 19 September 2023 (UTC)[reply]

Cushitic proto-languages[edit]

Please add these to the module:

m["cus-cen-pro"] = {
	"Proto-Agaw",
	nil,
	"cus-cen",
	"Latinx",
	type = "reconstructed",
}

m["cus-eas-pro"] = {
	"Proto-East Cushitic",
	nil,
	"cus-eas",
	"Latinx",
	type = "reconstructed",
}

See Appendix:Proto-Agaw reconstructions and Appendix:Proto-Cushitic_reconstructions#Proto-East_Cushitic. --Ivadon (talk) 20:05, 2 May 2023 (UTC)[reply]

@Ivadon: From my experience, any reconstructions other than Proto-Highland East Cushitic are very tentative and often are not reconstructed well. This makes these codes pretty useless, because we won't create entries for these languages. Even Proto-Cushitic's creation was pretty controversial and iirc there was a bot job to remove any linked Proto-Cushitic term from the mainspace (although maybe it was just discussed, and I'm mixing things up). Thadh (talk) 20:24, 2 May 2023 (UTC)[reply]
How about Heine's Proto-Sam (Somaloid) and Kießling and Mous' Proto-West Rift? They look fine to me. --Ivadon (talk) 22:47, 2 May 2023 (UTC)[reply]
@Ivadon: Those seem fine. Do you want me to add them? Thadh (talk) 12:13, 4 May 2023 (UTC)[reply]
@Thadh: Please add them as Proto-Somaloid and Proto-South Cushitic. I will start working on the latter. Ivadon (talk) 20:31, 4 May 2023 (UTC)[reply]
@Ivadon:  Done Please chech that I haven't missed any Somaloid language. Also, we don't seem to have a code for Girirra yet, do you think we are in need of one? Thadh (talk) 21:14, 4 May 2023 (UTC)[reply]
@Thadh: Thank you. Tunni (tqq) is still missing. Girirra was recently adopted, see [1]. Both seem to be distinct from Somali, so yes. Ivadon (talk) 18:26, 5 May 2023 (UTC)[reply]
@Ivadon:  Done. Ping me again if you run into any more issues :) Thadh (talk) 23:34, 5 May 2023 (UTC)[reply]