Wiktionary:Persian transliteration/Iranian
Appearance
This page focuses on Iranian Persian, for transliteration rules concerning Classical Persian or Dari Persian see Persian transliteration/Classical. For transliteration rules concerning Tajik, see Wiktionary:Tajik transliteration.
Persian transliteration
[edit]| Persian | Wiktionary | IPA | Others (dispreferred) |
|---|---|---|---|
| ا (word-initial) | a, o, e | (ʔ)æ, (ʔ)o, (ʔ)e | |
| ا (other positions[1]) | â | ɒː | ā |
| آ | â (word-initial) -â (other positions[1]) |
(ʔ)ɒː (word-initial) ʔɒː (other positions) |
ʼā |
| ب | b | b | |
| پ | p | p | |
| ت | t | t | |
| ث | s | s | th, s̱, ṯ, s̄ |
| ج | j | dʒ | dj, ǧ ,dzh |
| چ | č | tʃ | ch, c |
| ح | h | h | ḥ, ḩ |
| خ | x | x | kh, k͟h, ḫ, ḵ |
| د | d | d | |
| ذ | z | z | dh, d͟h, ẕ, ḏ |
| ر | r | r | |
| ز | z | z | |
| ژ | ž | ʒ | zh, z͟h |
| س | s | s | |
| ش | š | ʃ | sh, s͟h |
| ص | s | s | ṣ |
| ض | z | z | ḍ, ż, ẕ |
| ط | t | t | ṭ, ţ |
| ظ | z | z | ẓ, z̧ |
| ع | '
∅ (word initial[2]) |
ʔ, ː | ʻ, ʿ |
| غ | ġ | ɣ, ɢ | q, gh, g͟h, ğ |
| ف | f | f | |
| ق | ġ | ɣ, ɢ | q, gh, g͟h, ğ, ḳ |
| ک | k | k | |
| گ | g | g | |
| ل | l | l | |
| م | m | m | |
| ن | n | n | |
| و (consonant) | v | v | w |
| و (long vowel) | u | uː, oː | ô, ū, ō |
| و (diphthong[3]) | ow | ow | au, aw |
| خوا (e.g. خواندن etc.) | xâ | xɒː | xwā-, khwā- |
| خوی (e.g. خوید etc.) | xi | xiː | |
| ه (consonant) | h (may appear in final position after a vowel, e.g. ده (dah)) | h | |
| ـه (word-final vowel) | e | e, æ | eh, a, ah |
| ی (consonant) | y | j | j |
| ی (long vowel) | i | iː, eː | ê, ī, ē |
| ی (diphthong[4]) | ey | ej | ai, ay |
| یٰ (always word-final) | â | ɒː | ā, á |
- ↑ 1.0 1.1 In a compound such as سَرْاَنْجام (sar-anjâm) or سَرْآغاز (sar-âġâz) it is treated as a hyphen and a short vowel or â follows it. (Note that colloquially it is assimilated into the consonant before it e.g. saranjâm and sarâġâz)
- ^ Initial ع is not transliterated as it is indistinguishable from an equivalent alef with the same vowel. [1] – unless it is a dialect where the two are distinguished, like Nishapuri and Kulabi; e.g. 'ayb as a dialectal form of عِیْب (eyb)
- ^ Unless bearing a shadda, in which case it is treated as consonant, e.g. اوّل (avval), not *owval and مدعو (mad'ovv) not *mad'oww. (equiavvlent to aww and uww in Classical)
- ^ Unless bearing a shadda, in which case it is treated as consonant, e.g. ایّوب (ayyub), not *eyyub and حی (hayy), not *heyy; or a proper noun, so عزیر (ozayr), not *ozeyr.
Diphthongs
[edit]| Romanization | IPA | Final | Medial | Initial | Regular? |
|---|---|---|---|---|---|
| ey | /ej/ | ـِىْ | ـِیْـ | اِیْـ | Yes |
| ay | /æj/ | ـَىْ | ـَیْـ | اَیْـ | Only borrowings |
| oy | /oːj/ | ـُىْ | ـُیْـ | اُیْـ | Only borrowings |
| ây | /ɒːj/ | ـاىْ | ـایْـ | آیْـ | Yes |
| aw | /æw/ | ـَوْ | اَوْ | No | |
| âw | /ɒːw/ | ـاوْ | آوْ | Only borrowings | |
| uy | /uːj/ | ـوىْ | ـویْـ | اویْـ | Yes |
- Classical Persian aw and in most cases ay have turned into Iranian Persian ow and ey respectively.
Other symbols or combinations
[edit]- ـاً (-an) (always word-final) – an (The position of [fathatan] is after the alef, not before, as is the current practice with Arabic)
- When a word ending with آ (â), ـا (â), ء (') or other variations gets the tanvin, it is written as either ءً ('an) or ـئاً (-'an) and is transliterated as – 'an in both cases.
- اِبْتِداءً ― ebtedâ'an ― in the beginning
- اِبْتِدائاً ― ebtedâ'an ― in the beginning
- ـٌ (-on) (always word-final, only in compound terms related to grammar and math, usually not written) – on
- مُضافٌاِلَیْه ― mozâfon-elayh ― genitive noun
- ـٍ (-en) (always word-final, only in unadapted borrowings, rare) – en
- بِعِبارَةٍاُخْریٰ ― be'ebâraten-oxrâ ― in other words
- All forms of hamze, including ء, ؤ, ئ, and أ are transliterated as ' (dispreferred: ʻ, ʿ)
- The (ezâfe) vowel is transcribed differently depending on context:
- ـِ (-e) (ezâfe) (always word-final, after a consonant, mostly unmarked in regular writing) – -e
- ـیِ (--ye) (ezâfe) (after long vowels ا (â) or و (u), sometimes marked in regular writing) – -ye
- آفْریقایِ جُنوبی ― âfriġâ-ye jonubi ― South Africa
- ـیِّ (i-ye) (ezâfe) (always word-final with ی (i), mostly unmarked in regular writing) – i-ye
- جُمْهوریِ تاجیکِسْتان ― jomhuri-ye tâjikestân ― Republic of Tajikistan
- ـهٔ (-h-ye) (U+0647 U+0654), sometimes written as ـهی (-h-i) (always word-final) – e-ye. (Articles don't contain the hamze above "he", it is considered a diacritic and only used in the display using
|head=. Templates link to words without the hamze.)
- indefinite article ـی (-i):
- After a consonant: -i
- After ـه (-e):
- After long vowel آ (â):
- After long vowel ی (i):
- After vowel و (u, o):
- Examples:
- بازیای ― bâzi-i ― a game
- گلهائی ― gol-hâ-'i ― some flowers
- بویی ― bu-yi ― a scent
- اِنْشائی ― enšâ'-i ― an essay (hamzated انشاء)
- لؤلؤی ― lo'lo'-i ― a pearl
- مجتمعی ― mojtama'-i ― a building
- The adjectival and abstract noun version of ـی (-i) follow the above rules without the hyphen – Note that if ـی (-i) is used to form an abstract noun, it gets converted to ـِگی (-egi) if the word ends with an ـه (-e)
- نه (“no, not”) – na (an exception)
- ـّ (tashdid) – geminate consonant (Arabic shadda)
- ـَ (-a) (fathe/zor) – a (Arabic fatha)
- ـِ (-e) (kasre/zir) – e (in modern Iranian, to check cases where it's "i") (Arabic kasra)
- ـُ (-o) (zamme/pish) – o (in modern Iranian, to check cases where it's "u") (Arabic damma). Used after consonants to make a short "o" sound. If used before و (o) produces a diphthong "ow":
- نُوْروز (nowruz)
- ـ۟ (sukūn/sokun) - marks absence of a vowel. Rarely used in popular Persian vocalisations, especially on final consonants. It may be necessary to use consistently in strict vocalisations to avoid any misreadings, allow automation and signalling that a word IS vocalised.
- ـه (-h) (in the word-final position after consonants for a large number of words) - e (no hyphen) (note with ezâfe the preferred spelling is ـهٔ (-h-ye))
- هَفْتِه ― hafte ― week
- ـه (-h) - when used as a colloquial copula in the 3rd person singular (he/she/it is) - -e (with a hyphen)
- تِهْرون پایْتَخْتِ ایرونِه. (colloquial) ― tehrun pâytaxt-e irun-e. ― Tehran is the capital of Iran.
- ZWNJ – - (hyphen)
- The use of hyphens for etymological reasons - suffixes, compound words, etc. when no ZWNJ is used is to be discussed. E.g. currently plural form suffix ـها (-hâ) is transliterated as "-hâ" regardless if ZWNJ is present or not. (Apart from cases described above and ZWNJ, the use of hyphen is otherwise dispreferred. A space is transliterated as a space and the absence of space or ZWNJ is transliterated as nothing.)
- Below are transliteration examples to contrast the use of ZWNJ on connecting letters, space, nothing and non-connecting letters:
ZWNJ with ZWNJ with space, no ZWNJ no space, no ZWNJ non-joining کتابها کتاب ها کتابها اتوها ketâb-hâ ketâb hâ ketâbhâ otu-hâ plural of کِتاب (ketâb) plural of اُتو (otu)
- ـیـ (-y-): (V)y(V) (for Arabic borrowings, see below)
Arabic loanwords
[edit]- ـة (-h) (always word-final) – at (rare, only in unadapted borrowings from Arabic, normally adapted into Persian as ـت (at) or ـه (e))
- الـ (note the hyphen after "l" or the first of the assimilated letter is for etymological reasons):
- al- (normally):
- اَلْیُوم (rare) ― al-yowm ― today
- "l" can change to the following consonant if it's a "sun letter", and "a" can change to "o" (Arabic "u") in ezâfe, e.g.
- With ZWNJ: -(C)-(C)
- فارِغُالتَّحْصیل (fâreġo-t-tahsil)
- With space: (C)-(C)
- فارِغٌ التَّحْصیل (fâreġo t-tahsil)
- With ZWNJ: -(C)-(C)
- moon letters (Note that the alef remains silent (اَلِفِ وَصْل (alef-e vasl))):
- With ZWNJ: -l-
- بِیْنُالْمِلَلی (beyno-l-melali)
- With space: l-
- بِیْنُ الْمِلَلی (beyno l-melali)
- Most of the time the vowel of the last first word is "o" following Arabic grammar rules, but variations do exists:
- full-words: – e
- بِالْکُلّ (be-l-koll)
- sometimes: – a
- بِیْنَالْمِلَلی (beyna-l-melali)
- بِیْنَ الْمِلَلی (beyna l-melali)
- full-words: – e
- al- (normally):
- ـیّت, ـیّات, and ـیّه: iyat, iyât, and iye (for native ـیـ (-y-), see above)
- اَهَمّیَّت (ahammiyat)
- تَقْویَّت (taġviyat)
- خُصوصیّات (xosusiyât)
- نَشْریّه (našriye)