This Wikipedia page is currently inactive and is retained for historical reference.
Either the page is no longer relevant or consensus has become unclear. If you want to revive discussion regarding the subject, you should seek broader input via a forum such as the proposals page of the village pump.
The Persian language has been written with a number of different scripts, including Avestan, Old-Persian, Partho-Sasanian Pahlavi (Middle-Persian) and Aramaic. After the Islamic conquest of the Persian Sassanian Empire in 651 AD, Arabic replaced Middle-Persian the language of government, culture and especially religion for the next two centuries.
Modern Persian appeared during the 9th century. It is written in a modified version of the Arabic script and contains large number of Arabic lexical. There are also two methods of writing Persian with the Latin alphabet.
Under Mongolian and Turkish rulers, Persian was adopted as the language of government in Turkey, central Asia and India, where it was used for centuries, and until after 1900 in Kashmir.
Persian is a member of the Iranian branch of Indo-European languages. There are a number of closely-related varieties of Persian, including:
Farsi (فارسی), is spoken by about 40 million people, mainly in central and south central Iran. There are a further 2.5 million speakers in many other countries including Bahrain, Iraq, Oman, Yemen, the UAE and the USA.
Dari (دَرِي) is spoken by more than 7 million people in Afghanistan and Pakistan.
Tajik (Тоҷикӣ / تاجيكي) is spoken by in Tajikistan, Kazakhstan, Kyrgyzstan, Russia, Turkmenistan, Ukraine and Uzbekistan by about 4.4 million people.
Standard transliteration
Printability - use of the unicode template
Note that several letters proposed in the strict transliteration system below are non-printable in several hardware/software/settings combinations, e.g. ḥ, ṣ, ḍ, ṭ, ṛ, ẓ and ṁ. These letters can be made visible on most systems by enclosing them in the {{unicode}} template, like this: {{unicode|ḥ, ṣ, ḍ, ṭ, ṛ, ẓ and ṁ}}, which results in: ḥ, ṣ, ḍ, ṭ, ṛ, ẓ and ṁ. This template should be used for most expressions using strict transliteration.
Similarly ʾ, ʿ, ᾿ and ῾ can only be used when "unicodified": {{unicode|ʾ, ʿ, ᾿ and ῾}} → ʾ, ʿ, ᾿ and ῾
The {{PerTranslit}} template includes this "unicodifying" of characters that have this printability issue.
All Persian articles should have a lead paragraph which includes the article title, along with the original Persian script and the strict transliteration in parenthesis, preferably in the lead sentence. The article title, the Persian script, and the transliteration should all be in boldface.
Some cases will require variations on this format. If the name is extremely long, the first appearance of the name is suitable to provide the strict transliteration. Likewise, if a strict transliteration appears overly repetitious, it should be in place of the page title in the lead paragraph.
Example:
Abū al-‘Abbās ‘Abd Allāh ibn Muḥammad as-Saffāḥ (Arabic: أبو العباس عبد الله بن محمد السفاح) (721 - 754) was the first Abbasid caliph. Abu al-`Abbas was the head of...
Redirects
All common transliterations should redirect to the article. There will often be many redirects, but this is intentional and does not represent a problem.
Alphabetization
Alphabetize by family name in modern cases where there is one, otherwise by the first component in the commonly used name
For alphabetization, the definite article "al-" and its variants (ash-, ad-, etc.) should be ignored, unless the primary transliteration makes the prefix a part of the name (such as Mohamed ElBaradei).
Example: Al-Qaeda should be alphabetized as "Qaeda".
For alphabetization, the family name designators "bin", "ibn", and "bint" should be ignored, unless the primary transliteration makes it a part of the name (as in the Saudi Binladin Group).
For alphabetization, the apostrophe (representing hamza) should be ignored, and letters with diacriticals should be alphabetized as if they did not have their diacriticals.
Example: Ibn Sa'ūd should be alphabetized as "Saud".
^ "In initial position, whether at the beginning of a word, following a prefixed preposition or conjunction, or following the definite article, hamza is not represented in romanization. When medial or final, hamza is romanized."
Vowels
Short vowels
Persian
Standard translit.
Strict translit.
Examples
اَ
a
a
in استراباد, Astarābād
اِ
e
e also ə
in اصفهان, Esfahān / Әsfahān
اُ
o
o
like خورشید - Xoršīd
ی
i
ī
like این - In
Long vowels
Persian
Standard translit.
Strict translit.
Examples
آ
a
ā
in استراباد, Astarābād
اُ
i
ō
like امید, ōmid
و
u
ū
like کوروش Kūroš
ی
i
ī
like ایران, Īrān
Ending of -eh
After the Arab invasion of Iran, the Middle-Persian ending of -i was replaced with ه (-eh), which sounds like -e, such as 'شاهنانه' (Shahnameh → Šāhnāmé), ژاله (Dzaleh or Jaleh → Žālé) or خانه, (Khaneh → Xāné) the modern Persian transliteration (Including Encyclopaedia Iranica) uses -é for -eh ending.
However, in Dari (the variety of Persian spoken in Afghanistan), the ending -ه- is pronounced and transliterated as -ah: Shahnamah → Šāhnāma, Zhalah → Žāla, Khanah → Khāna, etc. The usage of -ah is also supported in ALA-LC standards for the Persian language.
Names
The standard transliteration of Persian names im middle-Ages was used a single "ibn" or "bint" father's name when known and appropriate, and a family name at the end. Note that these terms are abandoned in modern-Persian.
Example: "Abū ‘Alī al-Husayn ibn ‘Abd Allāh ibn Sīnā "
Counter-example (Modern Persian): "Abu Ali Sina", "Ebn-e Sina" or "Pur Sina".
Example: "Abu Hāmed Mohammad al-Ghazālī"
Counter-example (Modern Persian): "Emām Ghazali".
When the Arabic script was adopted for the Persian language, there were letters pronounced in Persian which did not have a representation in the Arabic alphabet, and vice versa. The Persian alphabet adds letters to the Arabic alphabet, and changes the pronunciation of some Arabic letters which are not pronounced in Persian. In addition, Persian does not use a definite article ("al-"). Some of the vowels, long or short, remain transliterated the same as in Arabic.