Persian alphabet

(Redirected from Perso-Arabic)
(Learn how and when to remove this template message)

The Persian alphabet (Persian: الفبای فارسی, romanizedAlefbâye Fârsi), also known as the Perso-Arabic script, is the right-to-left alphabet used for the Persian language. It is a variation of the Arabic alphabet with five additional letters: پ چ ژ گ, in addition the obsolete ڤ.[1]

Persian alphabet
الفبای فارسی
Alefbâye Fârsi
"Fârsi" written in the Persian alphabet in Nastaliq style
Script type
Abjad
DirectionRight-to-left script Edit this on Wikidata
LanguagesPersian
Related scripts
Parent systems
 This article contains phonetic transcriptions in the International Phonetic Alphabet (IPA). For an introductory guide on IPA symbols, see Help:IPA. For the distinction between [ ], / / and  , see IPA § Brackets and transcription delimiters.

It was the basis of many Arabic-based scripts used in Central and South Asia. It is used for the Iranian and Dari standard varieties of Persian; and is one of two official writing systems for the Persian language, alongside the Cyrillic-based Tajik alphabet.

The script is mostly but not exclusively right-to-left; mathematical expressions, numeric dates and numbers bearing units are embedded from left to right. The script is cursive, meaning most letters in a word connect to each other; when they are typed, contemporary word processors automatically join adjacent letter forms.

History

The Persian alphabet is directly derived and developed from the Arabic alphabet. The Arabic alphabet was introduced to the Persian-speaking world after the Muslim conquest of Persia and the fall of the Sasanian Empire in the 7th century. Following which, the Arabic language became the principal language of government and religious institutions in Persia, which led to the widespread usage of the Arabic script. Classical Persian literature and poetry were affected by this simultaneous usage of Arabic and Persian. A new influx of Arabic vocabulary soon entered the Persian language.[2] In the 8th century, the Tahirid dynasty and Samanid dynasty officially adopted the Arabic script for writing Persian, followed by the Saffarid dynasty in the 9th century, gradually displacing the various Pahlavi scripts used for the Persian language prior. By the 9th-century, the Perso-Arabic alphabet became the dominant form of writing in Greater Khorasan.[2][3][4]

Under the influence of various Persian Empires, many languages in Central and South Asia that adopted the Arabic script use the Persian Alphabet as the basis of their writing systems. Today, extended versions of the Persian alphabet are used to write a wide variety of Indo-Iranian languages, including Kurdish, Balochi, Pashto, Urdu (from Classical Hindostani), Saraiki, Panjabi, Sindhi and Kashmiri. In the past the use of the Persian alphabet was common amongst Turkic languages, but today is relegated to those spoken within Iran, such as Azerbaijani, Turkmen, Qashqai, Chaharmahali and Khalaj. The Uyghur language in western China is the most notable exception to this.

During the colonization of Central Asia, many languages in the Soviet Union, including Persian, were reformed by the government. This ultimately resulted in the Cyrillic-based alphabet used in Tajikistan today. See: Tajik alphabet § History.

Letters

Example showing the Nastaʿlīq calligraphic style's proportion rules[citation needed]

Below are the 32 letters of the modern Persian alphabet. Since the script is cursive, the appearance of a letter changes depending on its position: isolated, initial (joined on the left), medial (joined on both sides) and final (joined on the right) of a word.[5]

The names of the letters are mostly the ones used in Arabic except for the Persian pronunciation. The only ambiguous name is he, which is used for both ح and ه. For clarification, they are often called ḥä-ye jimi (literally "jim-like ḥe" after jim, the name for the letter ج that uses the same base form) and hâ-ye do-češm (literally "two-eyed he", after the contextual middle letterform ـهـ), respectively.

Overview table

#Name
(in Persian)
Name
(transliterated)
TransliterationIPAUnicodeContextual forms
FinalMedialInitialIsolated
0همزهhamze[6]ʾGlottal stop [ʔ]U+0621ء
U+0623ـأأ
U+0626ـئـئـئـئ
U+0624ـؤؤ
1الفʾalefâ[ɒ]U+0627ـاا
2بbeb[b]U+0628ـبـبـبـب
3پpep[p]U+067Eـپـپـپـپ
4تtet[t]U+062Aـتـتـتـت
5ثs̱e[s]U+062Bـثـثـثـث
6جیمjimj[d͡ʒ]U+062Cـجـجـجـج
7چčeč[t͡ʃ]U+0686ـچـچـچـچ
8حḥe (ḥâ-ye ḥotti, ḥâ-ye jimi)[h]U+062Dـحـحـحـح
9خxex[x]U+062Eـخـخـخـخ
10دالdâld[d]U+062Fـدد
11ذالẕâl[z]U+0630ـذذ
12رrer[r]U+0631ـرر
13زzez[z]U+0632ـزز
14ژžež[ʒ]U+0698ـژژ
15سینsins[s]U+0633ـسـسـسـس
16شینšinš[ʃ]U+0634ـشـشـشـش
17صادṣâd[s]U+0635ـصـصـصـص
18ضادzâdż[z]U+0636ـضـضـضـض
19طاt[t]U+0637ـطـطـطـط
20ظاẓâ[z]U+0638ـظـظـظـظ
21عینʿaynʿ[ʔ], [æ]/[a]U+0639ـعـعـعـع
22غینġaynġ[ɢ], [ɣ]U+063Aـغـغـغـغ
23فfef[f]U+0641ـفـفـفـف
24قافqâfq[q]U+0642ـقـقـقـق
25کافkâfk[k]U+06A9ـکـکـکـک
26گافgâfg[ɡ]U+06AFـگـگـگـگ
27لامlâml[l]U+0644ـلـلـلـل
28میمmimm[m]U+0645ـمـمـمـم
29نونnunn[n]U+0646ـنـنـنـن
30واوvâv (in Farsi)v / ū / ow / o[], [ow], [v], [o] (only word-finally)U+0648ـوو
wâw (in Dari)w / ū / aw / ō[], [w], [aw], []
31هhe (hā-ye havvaz, hā-ye do-češm)h[h], or [e] and [a] (word-finally)U+0647ـهـهـهـه
32یyey / ī / á / (Also ay / ē in Dari)[j], [i], [ɒː] ([aj] / [] in Dari)U+06CCـیـیـیـی

Historically, in Early New Persian, there was a special letter for the sound /β/. This letter is no longer used, as the /β/-sound changed to /b/, e.g. archaic زڤان /zaβān/ > زبان /zæbɒːn/ 'language'.[7]

SoundIsolated formFinal formMedial formInitial formName
/β/ڤـڤـڤـڤـβe

Variants

ی ه و ن م ل گ ک ق ف غ ع ظ ط ض ص ش س ژ ز ر ذ د خ ح چ ج ث ت پ ب ا ء
Noto Nastaliq Urdu
Scheherazade
Lateef
Noto Naskh Arabic
Markazi Text
Noto Sans Arabic
Baloo Bhaijaan
El Messiri SemiBold
Lemonada Medium
Changa Medium
Mada
Noto Kufi Arabic
Reem Kufi
Lalezar
Jomhuria
Rakkas
The alphabet in 16 fonts: Noto Nastaliq Urdu, Scheherazade, Lateef, Noto Naskh Arabic, Markazi Text, Noto Sans Arabic, Baloo Bhaijaan, El Messiri SemiBold, Lemonada Medium, Changa Medium, Mada, Noto Kufi Arabic, Reem Kufi, Lalezar, Jomhuria, and Rakkas.

Letter construction

forms (i)isolatedء ا ى ں ٮ ح س ص ط ع ڡ ٯ ک ل م د ر و ه
startءاٮـحـسـصـطـعـڡـکـلـمـدروهـ
midءـاـٮــحــســصــطــعــڡــکــلــمــدـرـوـهـ
endءـاـىـںـٮـحـسـصـطـعـڡـٯـکـلـمـدـرـوـه
i'jam (i)
Unicode0621 ..0627 ..0649 ..06BA ..066E ..062D ..0633 ..0635 ..0637 ..0639 ..06A1 ..066F ..066F ..0644 ..0645 ..062F ..0631 ..0648. ..0647 ..
1 dot belowبج
UnicodeFBB3.0628 ..062C ..
1 dot aboveنخضظغفذز
UnicodeFBB2.0646 ..062E ..0636 ..0638 ..063A ..0641 ..0630 ..0632 ..
2 dots below (ii)ی
UnicodeFBB5.06CC ..
2 dots aboveتقة
UnicodeFBB4.062A ..0642 ..0629 ..
3 dots belowپچ
UnicodeFBB9. FBB7.067E ..0686 ..
3 dots aboveثشژ
UnicodeFBB6.062B ..0634 ..0698 ..
line aboveگ
Unicode203E.06AF ..
noneءایںحسصطعکلمدروه
Unicode0621 ..0627 ..0649 ..06BA ..062D ..0633 ..0635 ..0637 ..0639 ..066F ..0644 ..0645 ..062F ..0631 ..0648. ..0647 ..
madda aboveۤآ
Unicode06E4. 0653.0622 ..
Hamza belowــٕـإ
Unicode0655.0625 ..
Hamza aboveــٔـأئؤۀ
Unicode0674. 0654.0623 ..0626 ..0624 ..06C0 ..

^i. The i'jam diacritic characters are illustrative only; in most typesetting the combined characters in the middle of the table are used.

^ii. Persian has 2 dots below in the initial and middle positions only. The standard Arabic version ي يـ ـيـ ـي always has 2 dots below.

Letters that do not link to a following letter

Seven letters (و, ژ, ز, ر, ذ, د, ا) do not connect to the following letter, unlike the rest of the letters of the alphabet. The seven letters have the same form in isolated and initial position and a second form in medial and final position. For example, when the letter ا alef is at the beginning of a word such as اینجا injâ ("here"), the same form is used as in an isolated alef. In the case of امروز emruz ("today"), the letter ر re takes the final form and the letter و vâv takes the isolated form, but they are in the middle of the word, and ز also has its isolated form, but it occurs at the end of the word.

Diacritics

Persian script has adopted a subset of Arabic diacritics: zabar /æ/ (fatḥah in Arabic), zēr /e/ (kasrah in Arabic), and pēš /ou̯/ or /o/ (ḍammah in Arabic, pronounced zamme in Western Persian), tanwīne nasb /æn/ and šaddah (gemination). Other Arabic diacritics may be seen in Arabic loanwords in Persian.

Short vowels

Of the four Arabic diacritics, the Persian language has adopted the following three for short vowels. The last one, sukūn, which indicates the lack of a vowel, has not been adopted.

Short vowels
(fully vocalized text)
Name
(in Persian)
Name
(transliterated)
Trans.(a)Value (b)

(Farsi/Dari)

064E
◌َ
زبر
(فتحه)
zebar/zibara/æ//a/
0650
◌ِ
زیر
(کسره)
zer/zire; i/e//ɪ/; /ɛ/
064F
◌ُ
پیش
(ضمّه)
peš/pišo; u/o//ʊ/

^a. There is no standard transliteration for Persian. The letters 'i' and 'u' are only ever used as short vowels when transliterating Dari or Tajik Persian. See Persian Phonology

^b. Diacritics differ by dialect, due to Dari having 8 distinct vowels compared to the 6 vowels of Farsi. See Persian Phonology

In Farsi, none of these short vowels may be the initial or final grapheme in an isolated word, although they may appear in the final position as an inflection, when the word is part of a noun group. In a word that starts with a vowel, the first grapheme is a silent alef which carries the short vowel, e.g. اُمید (omid, meaning "hope"). In a word that ends with a vowel, letters ع, ه and و respectively become the proxy letters for zebar, zir and piš, e.g. نو (now, meaning "new") or بسته (bast-e, meaning "package").

Tanvin (nunation)

Nunation (Persian: تنوین, tanvin) is the addition of one of three vowel diacritics to a noun or adjective to indicate that the word ends in an alveolar nasal sound without the addition of the letter nun.

Nunation
(fully vocalized text)
Name
(in Persian)
Name
(transliterated)
Notes
064B
َاً، ـاً، ءً
تنوین نَصْبْTanvine nasb
064D
ٍِ
تنوین جَرّTanvine jarrNever used in the Persian language.

Taught in Islamic nations to

complement Quran education.

064C
ٌ
تنوین رَفْعْTanvine rafʿ

Tašdid

SymbolName
(in Persian)
Name
(transliteration)
0651
ّ
تشدیدtašdid

Other characters

The following are not actual letters but different orthographical shapes for letters, a ligature in the case of the lâm alef. As to (hamza), it has only one graphical form since it is never tied to a preceding or following letter. However, it is sometimes 'seated' on a vâv, ye or alef, and in that case, the seat behaves like an ordinary vâv, ye or alef respectively. Technically, hamza is not a letter but a diacritic.

NamePronunciationIPAUnicodeFinalMedialInitialStand-aloneNotes
alef maddeâ[ɒ]U+0622ـآآآThe final form is very rare and is freely replaced with ordinary alef.
he ye-eye or -eyeh[eje]U+06C0ـۀۀValidity of this form depends on region and dialect. Some may use the two-letter ـه‌ی or ه‌ی combinations instead.
lām alef[lɒ]U+0644 (lām) and U+0627 (alef)ـلالا
kašidaU+0640ـThis is the medial character which connects other characters

Although at first glance, they may seem similar, there are many differences in the way the different languages use the alphabets. For example, similar words are written differently in Persian and Arabic, as they are used differently.

Unicode has accepted U+262B FARSI SYMBOL in the Miscellaneous Symbols range.[8] In Unicode 1.0 this symbol was known as SYMBOL OF IRAN.[9]It is a stylization of الله (Allah) used as the emblem of Iran.It also a part of the flag of Iran, which is the typical rendering of "🇮🇷", the regional indicator symbol for Iran.

The Unicode Standard has a compatibility character defined U+FDFC RIAL SIGN that can represent ریال, the Persian name of the currency of Iran.[10]

Novel letters

The Persian alphabet has four extra letters that are not in the Arabic alphabet: /p/, /t͡ʃ/ (ch in chair), /ʒ/ (s in measure), /ɡ/. An additional fifth letter ڤ was used for /β/ (v in Spanish huevo) but it is no longer used.

SoundShapeNameUnicode code point
/p/پpeU+067E
/t͡ʃ/ (ch)چčeU+0686
/ʒ/ (zh)ژžeU+0698
/ɡ/گgâfU+06AF

Deviations from the Arabic script

Persian uses the Eastern Arabic numerals, but the shapes of the digits 'four' (۴), 'five' (۵), and 'six' (۶) are different from the shapes used in Arabic. All the digits also have different codepoints in Unicode:[11]

NamePersianUnicodeArabicUnicode
0۰U+06F0٠U+0660
1۱U+06F1١U+0661
2۲U+06F2٢U+0662
3۳U+06F3٣U+0663
4۴U+06F4٤U+0664
5۵U+06F5٥U+0665
6۶U+06F6٦U+0666
7۷U+06F7٧U+0667
8۸U+06F8٨U+0668
9۹U+06F9٩U+0669
yeیU+06CCي [a]U+064A
kāfکU+06A9كU+0643

Comparison of different numerals

Western Arabic012345678910
Eastern Arabic[a]٠١٢٣٤٥٦٧٨٩١٠
Persian[b]۰۱۲۳۴۵۶۷۸۹۱۰
Urdu[c]۰۱۲۳۴۵۶۷۸۹۱۰
Abjad numerals ابجدهوزحطي

Word boundaries

Typically, words are separated from each other by a space. Certain morphemes (such as the plural ending '-hâ'), however, are written without a space. On a computer, they are separated from the word using the zero-width non-joiner.

Cyrillic Persian alphabet in Tajikistan

As part of the russification of Central Asia, the Cyrillic script was introduced in the late 1930s.[12][13][14][15] The alphabet remained Cyrillic until the end of the 1980s with the disintegration of the Soviet Union. In 1989, with the growth in Tajik nationalism, a law was enacted declaring Tajik the state language. In addition, the law officially equated Tajik with Persian, placing the word Farsi (the endonym for the Persian language) after Tajik. The law also called for a gradual reintroduction of the Perso-Arabic alphabet.[16][17][18][19][20][21][22][23][24][25][26][27][excessive citations]

The Persian alphabet was introduced into education and public life, although the banning of the Islamic Renaissance Party in 1993 slowed adoption. In 1999, the word Farsi was removed from the state-language law, reverting the name to simply Tajik.[1] As of 2004 the de facto standard in use is the Tajik Cyrillic alphabet,[2] and as of 1996 only a very small part of the population can read the Persian alphabet.[3]

See also

References

External links

Retrieved from "https:https://www.search.com.vn/wiki/index.php?lang=en&q=Persian_alphabet&oldid=1220305913"
🔥 Top keywords: Main PageSpecial:SearchIndian Premier LeagueWikipedia:Featured picturesPornhubUEFA Champions League2024 Indian Premier LeagueFallout (American TV series)Jontay PorterXXXTentacionAmar Singh ChamkilaFallout (series)Cloud seedingReal Madrid CFCleopatraRama NavamiRichard GaddDeaths in 2024Civil War (film)Shōgun (2024 miniseries)2024 Indian general electionJennifer PanO. J. SimpsonElla PurnellBaby ReindeerCaitlin ClarkLaverne CoxXXX (film series)Facebook2023–24 UEFA Champions LeagueYouTubeCandidates Tournament 2024InstagramList of European Cup and UEFA Champions League finalsJude BellinghamMichael Porter Jr.Andriy LuninCarlo AncelottiBade Miyan Chote Miyan (2024 film)