Script (Unicode)

ழ்
ع‎‎ ש‎‎ ДA‎

In Unicode, a script is a collection of letters and other written signs used to represent textual information in one or more writing systems.[1] Some scripts support one and only one writing system and language, for example, Armenian. Other scripts support many different writing systems; for example, the Latin script supports English, French, German, Italian, Vietnamese, Latin itself, and several other languages. Some languages make use of multiple alternate writing systems and thus also use several scripts; for example, in Turkish, the Arabic script was used before the 20th century but transitioned to Latin in the early part of the 20th century. More or less complementary to scripts are symbols and Unicode control characters.

The unified diacritical characters and unified punctuation characters frequently have the "common" or "inherited" script property. However, the individual scripts often have their own punctuation and diacritics, so that many scripts include not only letters but also diacritic and other marks, punctuation, numerals and even their own idiosyncratic symbols and space characters.

Unicode 15.1 defines 161 separate scripts, including 94 modern scripts and 67 ancient or historic scripts.[2][3] More scripts are in the process for encoding or have been tentatively allocated for encoding in roadmaps.[4]

Definition and classification edit

When multiple languages make use of the same script, there are frequently some differences, particularly in diacritics and other marks. For example, Swedish and English both use the Latin script. However, Swedish includes the character å (sometimes called a Swedish O), while English has no such character. Nor does English make use of the diacritic combining ring above for any character. In general, the languages sharing the same scripts share many of the same characters. Despite these peripheral differences in the Swedish and English writing systems, they are said to use the same Latin script. Thus, the Unicode abstraction of scripts is a basic organizing technique. The differences among different alphabets or writing systems remain and are supported through Unicode’s flexible scripts, combining marks and collation algorithms.

Script versus writing system edit

Writing system is sometimes treated as a synonym for "script". However, it also can be used as the specific concrete writing system supported by a script. For example, the Vietnamese writing system is supported by the Latin script. A writing system may also cover more than one script; for example, the Japanese writing system makes use of the Han, Hiragana and Katakana scripts.

Most writing systems can be broadly divided into several categories: logographic, syllabic, alphabetic (or segmental), abugida, abjad and featural; however, all features of any of these may be found in any given writing system in varying proportions, often making it difficult to purely categorize a system. The term complex system is sometimes used to describe those where the admixture makes classification problematic.

Unicode supports all of these types of writing systems through its numerous scripts. Unicode also adds further properties to characters to help differentiate the various characters and the ways they behave within Unicode text-processing algorithms.

Special script property values edit

In addition to explicit or specific script properties, Unicode uses three special values:[5]

Common
Unicode can assign a character in the UCS to a single script only. However, many characters—those that are not part of a formal natural-language writing system or are unified across many writing systems—may be used in more than one script (for example, currency signs, symbols, numerals and punctuation marks). In these cases Unicode defines them as belonging to the "common" script (ISO 15924 code "Zyyy").
Inherited
Many diacritics and non-spacing combining characters may be applied to characters from more than one script. In these cases Unicode assigns them to the "inherited" script (ISO 15924 code Zinh), which means that they have the same script class as the base character with which they combine, and so in different contexts they may be treated as belonging to different scripts. For example, U+0308  ̈  COMBINING DIAERESIS may combine either with U+0065 e LATIN SMALL LETTER E to create a Latin ë or with U+0435 е CYRILLIC SMALL LETTER IE for the Cyrillic ё. In the former case, it inherits the Latin script of the base character, whereas in the latter case, it inherits the Cyrillic script of the base character.
Unknown
The value of "unknown" script (ISO 15924 code Zzzz) is given to unassigned, private-use, noncharacter, and surrogate code points.

Character categories within scripts edit

Unicode provides a general category property for each character. So in addition to belonging to a script every character also has a general category. Typically scripts include letter characters including: uppercase letters, lowercase letter and modifier letters. Some characters are considered titlecase letters for a few precomposed ligatures such as Dz (U+01F2). Such titlecase ligatures are all in the Latin and Greek scripts and are all compatibility characters, and therefore Unicode discourages their use by authors. It is unlikely that new titlecase letters will be added in the future.

Most writing systems do not differentiate between uppercase and lowercase letters. For those scripts all letters are categorized as "other letter" or "modifier letter". Ideographs such as Unihan ideographs are also categorized as "other letters". A few scripts do differentiate between uppercase and lowercase however: Latin, Cyrillic, Greek, Armenian, Georgian, and Deseret. Even for these scripts there are some letters that are neither uppercase nor lowercase.

Scripts can also contain any other general category character such as marks (diacritic and otherwise), numbers (numerals), punctuation, separators (word separators such as spaces), symbols and non-graphical format characters. These are included in a particular script when they are unique to that script. Other such characters are generally unified and included in the punctuation or diacritic blocks. However, the bulk of characters in any script (other than the common and inherited scripts) are letters.

List of scripts in Unicode edit

Unicode defines over a hundred script names (called "Alias" or "Property value alias"), based on the ISO 15924 list.Unicode uses the "Common" script name for ISO 15924's Zyyy (code for undetermined script), "Inherited" for ISO 15924's Zinh (code for inherited script), and "Unknown" for ISO 15924's Zzzz (code for uncoded script). Not used are, among others, the ISO 15924 script codes: Zsym (Symbols) and Zmth (Mathematical notation). These are considered not to be scripts in Unicode sense.

ISO 15924Script in Unicode[e]
CodeISO numberISO formal nameDirectionalityUnicode Alias[f]VersionCharactersNotesDescription
Adlm166Adlamright-to-left script  Adlam9.088Ch 19.9
Afak439AfakavariesZZ— Not in Unicode, proposal is explored[i]
Aghb239Caucasian Albanianleft-to-right  Caucasian Albanian7.053Ancient/historicCh 8.11
Ahom338Ahom, Tai Ahomleft-to-right  Ahom8.065Ancient/historicCh 15.16
Arab160Arabicright-to-left script  Arabic1.01,368Ch 9.2
Aran161Arabic (Nastaliq variant)mixedZZ— Typographic variant of Arabic (see § Arab)
Armi124Imperial Aramaicright-to-left script  Imperial Aramaic5.231Ancient/historicCh 10.4
Armn230Armenianleft-to-right  Armenian1.096Ch 7.6
Avst134Avestanright-to-left script  Avestan5.261Ancient/historicCh 10.7
Bali360Balineseleft-to-right  Balinese5.0124Ch 17.3
Bamu435Bamumleft-to-right  Bamum5.2657Ch 19.6
Bass259Bassa Vahleft-to-right  Bassa Vah7.036Ancient/historicCh 19.7
Batk365Batakleft-to-right  Batak6.056Ch 17.6
Beng325Bengali (Bangla)left-to-right  Bengali1.096Ch 12.2
Bhks334Bhaiksukileft-to-right  Bhaiksuki9.097Ancient/historicCh 14.3
Blis550BlissymbolsvariesZZ— Not in Unicode, proposal is explored[i]
Bopo285Bopomofoleft-to-right, right-to-left script  Bopomofo1.077Ch 18.3
Brah300Brahmileft-to-right  Brahmi6.0115Ancient/historicCh 14.1
Brai570Brailleleft-to-right  Braille3.0256Ch 21.1
Bugi367Bugineseleft-to-right  Buginese4.130Ch 17.2
Buhd372Buhidleft-to-right  Buhid3.220Ch 17.1
Cakm349Chakmaleft-to-right  Chakma6.171Ch 13.11
Cans440Unified Canadian Aboriginal Syllabicsleft-to-right  Canadian Aboriginal3.0726Ch 20.2
Cari201Carianleft-to-right, right-to-left script  Carian5.149Ancient/historicCh 8.5
Cham358Chamleft-to-right  Cham5.183Ch 16.10
Cher445Cherokeeleft-to-right  Cherokee3.0172Ch 20.1
Chis298Chisoileft-to-rightZZ— Not in Unicode, proposal is mature[ii]
Chrs109Chorasmianright-to-left script, top-to-bottom  Chorasmian13.028Ancient/historicCh 10.8
Cirt291CirthvariesZZ— Not in Unicode
Copt204Copticleft-to-right  Coptic1.0137Ancient/historic, disunified from Greek in 4.1Ch 7.3
Cpmn402Cypro-Minoanleft-to-rightCypro Minoan14.099Ancient/historicCh 8.4
Cprt403Cypriot syllabaryright-to-left script  Cypriot4.055Ancient/historicCh 8.3
Cyrl220Cyrillicleft-to-right  Cyrillic1.0506Includes typographic variant Old Church Slavonic (see § Cyrs)Ch 7.4
Cyrs221Cyrillic (Old Church Slavonic variant)variesZZ— Typographic variant of Cyrillic (see § Cyrl); Ancient/historic
Deva315Devanagari (Nagari)left-to-right  Devanagari1.0164Ch 12.1
Diak342Dives Akuruleft-to-right  Dives Akuru13.072Ancient/historicCh 15.15
Dogr328Dograleft-to-right  Dogra11.060Ancient/historicCh 15.18
Dsrt250Deseret (Mormon)left-to-right  Deseret3.180Ch 20.4
Dupl755Duployan shorthand, Duployan stenographyleft-to-right  Duployan7.0143Ch 21.6
Egyd070Egyptian demoticmixedZZ— Not in Unicode
Egyh060Egyptian hieraticmixedZZ— Not in Unicode
Egyp050Egyptian hieroglyphsright-to-left script, left-to-right  Egyptian Hieroglyphs5.21,110Ancient/historicCh 11.4
Elba226Elbasanleft-to-right  Elbasan7.040Ancient/historicCh 8.10
Elym128Elymaicright-to-left script  Elymaic12.023Ancient/historicCh 10.9
Ethi430Ethiopic (Geʻez)left-to-right  Ethiopic3.0523Ch 19.1
Gara164Garayright-to-leftZZ— Not in Unicode, approved for version 16.0[iii]
Geok241Khutsuri (Asomtavruli and Nuskhuri)left-to-right  GeorgianUnicode groups Khutsori, Asomtavruli and Nuskhuri into 'Georgian' (see § Geok). Similarly, Mkhedruli and Mtavruli are 'Georgian' (see § Geor)Ch 7.7
Geor240Georgian (Mkhedruli and Mtavruli)left-to-right  Georgian1.0173In Unicode this also includes Nuskhuri (Geok)Ch 7.7
Glag225Glagoliticleft-to-right  Glagolitic4.1134Ancient/historicCh 7.5
Gong312Gunjala Gondileft-to-right  Gunjala Gondi11.063Ch 13.15
Gonm313Masaram Gondileft-to-right  Masaram Gondi10.075Ch 13.14
Goth206Gothicleft-to-right  Gothic3.127Ancient/historicCh 8.9
Gran343Granthaleft-to-right  Grantha7.085Ancient/historicCh 15.14
Grek200Greekleft-to-right  Greek1.0518Directionality sometimes as boustrophedonCh 7.2
Gujr320Gujaratileft-to-right  Gujarati1.091Ch 12.4
Gukh397Gurung Khemaleft-to-rightZZ— Not in Unicode, approved for version 16.0[iii]
Guru310Gurmukhileft-to-right  Gurmukhi1.080Ch 12.3
Hanb503Han with Bopomofo (alias for Han + Bopomofo)mixedZZ— See § Hani, § Bopo
Hang286Hangul (Hangŭl, Hangeul)left-to-right, vertical right-to-left  Hangul1.011,739Hangul syllables relocated in 2.0Ch 18.6
Hani500Han (Hanzi, Kanji, Hanja)top-to-bottom, columns right-to-left (historically)Han1.099,030Ch 18.1
Hano371Hanunoo (Hanunóo)left-to-right, bottom-to-top  Hanunoo3.221Ch 17.1
Hans501Han (Simplified variant)variesZZ— Subset of Han (Hanzi, Kanji, Hanja) (see § Hani)
Hant502Han (Traditional variant)variesZZ— Subset of § Hani
Hatr127Hatranright-to-left script  Hatran8.026Ancient/historicCh 10.12
Hebr125Hebrewright-to-left script  Hebrew1.0134Ch 9.1
Hira410Hiraganavertical right-to-left, left-to-right  Hiragana1.0381Ch 18.4
Hluw080Anatolian Hieroglyphs (Luwian Hieroglyphs, Hittite Hieroglyphs)left-to-right  Anatolian Hieroglyphs8.0583Ancient/historicCh 11.6
Hmng450Pahawh Hmongleft-to-right  Pahawh Hmong7.0127Ch 16.11
Hmnp451Nyiakeng Puachue Hmongleft-to-right  Nyiakeng Puachue Hmong12.071Ch 16.12
Hrkt412Japanese syllabaries (alias for Hiragana + Katakana)vertical right-to-left, left-to-right  Katakana or HiraganaSee § Hira, § KanaCh 18.4
Hung176Old Hungarian (Hungarian Runic)right-to-left script  Old Hungarian8.0108Ancient/historicCh 8.8
Inds610Indus (Harappan)mixedZZ— Not in Unicode, proposal is explored[i]
Ital210Old Italic (Etruscan, Oscan, etc.)right-to-left script, left-to-right  Old Italic3.139Ancient/historicCh 8.6
Jamo284Jamo (alias for Jamo subset of Hangul)variesZZ— Subset of § Hang
Java361Javaneseleft-to-right  Javanese5.290Ch 17.4
Jpan413Japanese (alias for Han + Hiragana + Katakana)variesZZ— See § Hani, § Hira and § Kana
Jurc510Jurchenleft-to-rightZZ— Not in Unicode
Kali357Kayah Lileft-to-right  Kayah Li5.147Ch 16.9
Kana411Katakanavertical right-to-left, left-to-right  Katakana1.0321Ch 18.4
Kawi368Kawileft-to-right  Kawi15.086Ancient/historicCh 17.9
Khar305Kharoshthiright-to-left script  Kharoshthi4.168Ancient/historicCh 14.2
Khmr355Khmerleft-to-right  Khmer3.0146Ch 16.4
Khoj322Khojkileft-to-right  Khojki7.065Ancient/historicCh 15.7
Kitl505Khitan large scriptleft-to-rightZZ— Not in Unicode
Kits288Khitan small scriptvertical right-to-left  Khitan Small Script13.0471Ancient/historicCh 18.12
Knda345Kannadaleft-to-right  Kannada1.091Ch 12.8
Kore287Korean (alias for Hangul + Han)left-to-rightZZ— See § Hani, § Hang
Kpel436Kpelleleft-to-rightZZ— Not in Unicode, proposal is explored[i]
Krai396Kirat Raileft-to-rightZZ— Not in Unicode, approved for version 16.0[iii]
Kthi317Kaithileft-to-right  Kaithi5.268Ancient/historicCh 15.2
Lana351Tai Tham (Lanna)left-to-right  Tai Tham5.2127Ch 16.7
Laoo356Laoleft-to-right  Lao1.083Ch 16.2
Latf217Latin (Fraktur variant)variesZZ— Typographic variant of Latin (see § Latn)
Latg216Latin (Gaelic variant)left-to-rightZZ— Typographic variant of Latin (see § Latn)
Latn215Latinleft-to-right  Latin1.01,481See also: Latin script in UnicodeCh 7.1
Leke364Lekeleft-to-rightZZ— Not in Unicode
Lepc335Lepcha (Róng)left-to-right  Lepcha5.174Ch 13.12
Limb336Limbuleft-to-right  Limbu4.068Ch 13.6
Lina400Linear Aleft-to-right  Linear A7.0341Ancient/historicCh 8.1
Linb401Linear Bleft-to-right  Linear B4.0211Ancient/historicCh 8.2
Lisu399Lisu (Fraser)left-to-right  Lisu5.249Ch 18.9
Loma437Lomaleft-to-rightZZ— Not in Unicode, proposal is explored[i]
Lyci202Lycianleft-to-right  Lycian5.129Ancient/historicCh 8.5
Lydi116Lydianright-to-left script  Lydian5.127Ancient/historicCh 8.5
Mahj314Mahajanileft-to-right  Mahajani7.039Ancient/historicCh 15.6
Maka366Makasarleft-to-right  Makasar11.025Ancient/historicCh 17.8
Mand140Mandaic, Mandaeanright-to-left script  Mandaic6.029Ch 9.5
Mani139Manichaeanright-to-left script  Manichaean7.051Ancient/historicCh 10.5
Marc332Marchenleft-to-right  Marchen9.068Ancient/historicCh 14.5
Maya090Mayan hieroglyphsmixedZZ— Not in Unicode
Medf265Medefaidrin (Oberi Okaime, Oberi Ɔkaimɛ)left-to-right  Medefaidrin11.091Ch 19.10
Mend438Mende Kikakuiright-to-left script  Mende Kikakui7.0213Ch 19.8
Merc101Meroitic Cursiveright-to-left script  Meroitic Cursive6.190Ancient/historicCh 11.5
Mero100Meroitic Hieroglyphsright-to-left script  Meroitic Hieroglyphs6.132Ancient/historicCh 11.5
Mlym347Malayalamleft-to-right  Malayalam1.0118Ch 12.9
Modi324Modi, Moḍīleft-to-right  Modi7.079Ancient/historicCh 15.12
Mong145Mongolianvertical left-to-right, left-to-right  Mongolian3.0168Mong includes Clear and Manchu scriptsCh 13.5
Moon218Moon (Moon code, Moon script, Moon type)mixedZZ— Not in Unicode, proposal is explored[i]
Mroo264Mro, Mruleft-to-right  Mro7.043Ch 13.8
Mtei337Meitei Mayek (Meithei, Meetei)left-to-right  Meetei Mayek5.279Ch 13.7
Mult323Multanileft-to-right  Multani8.038Ancient/historicCh 15.10
Mymr350Myanmar (Burmese)left-to-right  Myanmar3.0223Ch 16.3
Nagm295Nag Mundarileft-to-right  Nag Mundari15.042
Nand311Nandinagarileft-to-right  Nandinagari12.065Ancient/historicCh 15.13
Narb106Old North Arabian (Ancient North Arabian)right-to-left script  Old North Arabian7.032Ancient/historicCh 10.1
Nbat159Nabataeanright-to-left script  Nabataean7.040Ancient/historicCh 10.10
Newa333Newa, Newar, Newari, Nepāla lipileft-to-right  Newa9.097Ch 13.3
Nkdb085Naxi Dongba (na²¹ɕi³³ to³³ba²¹, Nakhi Tomba)left-to-rightZZ— Not in Unicode
Nkgb420Naxi Geba (na²¹ɕi³³ gʌ²¹ba²¹, 'Na-'Khi ²Ggŏ-¹baw, Nakhi Geba)left-to-rightZZ— Not in Unicode, proposal is explored[i]
Nkoo165N’Koright-to-left script  NKo5.062Ch 19.4
Nshu499Nüshuvertical right-to-left  Nushu10.0397Ch 18.8
Ogam212Oghambottom-to-top, left-to-right  Ogham3.029Ancient/historicCh 8.14
Olck261Ol Chiki (Ol Cemet’, Ol, Santali)left-to-right  Ol Chiki5.148Ch 13.10
Onao296Ol Onalleft-to-rightZZ— Not in Unicode, approved for version 16.0[iii]
Orkh175Old Turkic, Orkhon Runicright-to-left script  Old Turkic5.273Ancient/historicCh 14.8
Orya327Oriya (Odia)left-to-right  Oriya1.091Ch 12.5
Osge219Osageleft-to-right  Osage9.072Ch 20.3
Osma260Osmanyaleft-to-right  Osmanya4.040Ch 19.2
Ougr143Old UyghurmixedOld Uyghur14.026Ancient/historicCh 14.11
Palm126Palmyreneright-to-left script  Palmyrene7.032Ancient/historicCh 10.11
Pauc263Pau Cin Hauleft-to-right  Pau Cin Hau7.057Ch 16.13
Pcun015Proto-Cuneiformleft-to-rightZZ— Not in Unicode
Pelm016Proto-Elamiteleft-to-rightZZ— Not in Unicode
Perm227Old Permicleft-to-right  Old Permic7.043Ancient/historicCh 8.13
Phag331Phags-pavertical left-to-right  Phags-pa5.056Ancient/historicCh 14.4
Phli131Inscriptional Pahlaviright-to-left script  Inscriptional Pahlavi5.227Ancient/historicCh 10.6
Phlp132Psalter Pahlaviright-to-left script  Psalter Pahlavi7.029Ancient/historicCh 10.6
Phlv133Book PahlavimixedZZ— Not in Unicode
Phnx115Phoenicianright-to-left script  Phoenician5.029Ancient/historic[g]Ch 10.3
Piqd293Klingon (KLI pIqaD)left-to-right  ZZ— Rejected for inclusion in Unicode[iv][v]
Plrd282Miao (Pollard)left-to-right  Miao6.1149Ch 18.10
Prti130Inscriptional Parthianright-to-left script  Inscriptional Parthian5.230Ancient/historicCh 10.6
Psin103Proto-SinaiticmixedZZ— Not in Unicode
Qaaa-Qabx900-949Reserved for private use (range)ZZ— Not in Unicode
Ranj303Ranjanaleft-to-rightZZ— Not in Unicode
Rjng363Rejang (Redjang, Kaganga)left-to-right  Rejang5.137Ch 17.5
Rohg167Hanifi Rohingyaright-to-left script  Hanifi Rohingya11.050Ch 16.14
Roro620RongorongomixedZZ— Not in Unicode, proposal is explored[i]
Runr211Runicleft-to-right, boustrophedon  Runic3.086Ancient/historicCh 8.7
Samr123Samaritanright-to-left script, top-to-bottom  Samaritan5.261Ch 9.4
Sara292SaratimixedZZ— Not in Unicode
Sarb105Old South Arabianright-to-left script  Old South Arabian5.232Ancient/historicCh 10.2
Saur344Saurashtraleft-to-right  Saurashtra5.182Ch 13.13
Sgnw095SignWritingvertical left-to-right  SignWriting8.0672Ch 21.7
Shaw281Shavian (Shaw)left-to-right  Shavian4.048Ch 8.15
Shrd319Sharada, Śāradāleft-to-right  Sharada6.196Ch 15.3
Shui530Shuishuleft-to-rightZZ— Not in Unicode
Sidd302Siddham, Siddhaṃ, Siddhamātṛkāleft-to-right  Siddham7.092Ancient/historicCh 15.5
Sidt180Sideticright-to-leftZZ— Not in Unicode, proposal is mature[ii]
Sind318Khudawadi, Sindhileft-to-right  Khudawadi7.069Ch 15.9
Sinh348Sinhalaleft-to-right  Sinhala3.0111Ch 13.2
Sogd141Sogdianhorizontal and vertical writing in East Asian scripts, top-to-bottom  Sogdian11.042Ancient/historicCh 14.10
Sogo142Old Sogdianright-to-left script  Old Sogdian11.040Ancient/historicCh 14.9
Sora398Sora Sompengleft-to-right  Sora Sompeng6.135Ch 15.17
Soyo329Soyomboleft-to-right  Soyombo10.083Ancient/historicCh 14.7
Sund362Sundaneseleft-to-right  Sundanese5.172Ch 17.7
Sunu274Sunuwarleft-to-rightZZ— Not in Unicode, approved for version 16.0[iii]
Sylo316Syloti Nagrileft-to-right  Syloti Nagri4.145Ancient/historicCh 15.1
Syrc135Syriacright-to-left script  Syriac3.088Includes typographic variants Estrangelo (see § Syre), Western (§ Syrj), and Eastern (§ Syrn)Ch 9.3
Syre138Syriac (Estrangelo variant)mixedZZ— Typographic variant of Syriac (see § Syrc)
Syrj137Syriac (Western variant)mixedZZ— Typographic variant of Syriac (see § Syrc)
Syrn136Syriac (Eastern variant)mixedZZ— Typographic variant of Syriac (see § Syrc)
Tagb373Tagbanwaleft-to-right  Tagbanwa3.218Ch 17.1
Takr321Takri, Ṭākrī, Ṭāṅkrīleft-to-right  Takri6.168Ch 15.4
Tale353Tai Leleft-to-right  Tai Le4.035Ch 16.5
Talu354New Tai Lueleft-to-right  New Tai Lue4.183Ch 16.6
Taml346Tamilleft-to-right  Tamil1.0123Ch 12.6
Tang520Tangutvertical right-to-left, left-to-right  Tangut9.06,914Ancient/historicCh 18.11
Tavt359Tai Vietleft-to-right  Tai Viet5.272Ch 16.8
Tayo380Tai Yotop-to-bottom, columns right-to-leftZZ— Not in Unicode, proposal is mature[ii]
Telu340Teluguleft-to-right  Telugu1.0100Ch 12.7
Teng290Tengwarleft-to-rightZZ— Not in Unicode
Tfng120Tifinagh (Berber)left-to-right, right-to-left script, top-to-bottom, bottom-to-top  Tifinagh4.159Ch 19.3
Tglg370Tagalog (Baybayin, Alibata)left-to-right  Tagalog3.223Ch 17.1
Thaa170Thaanaright-to-left script  Thaana3.050Ch 13.1
Thai352Thaileft-to-right  Thai1.086Ch 16.1
Tibt330Tibetanleft-to-right  Tibetan2.0207Added in 1.0, removed in 1.1 and reintroduced in 2.0Ch 13.4
Tirh326Tirhutaleft-to-right  Tirhuta7.082Ch 15.11
Tnsa275Tangsaleft-to-rightTangsa14.089Ch 13.18
Todr229Todhriright-to-leftZZ— Not in Unicode, approved for version 16.0[iii]
Tols299Tolong Sikileft-to-rightZZ— Not in Unicode, proposal is mature[ii]
Toto294Totoleft-to-rightToto14.031Ch 13.17
Tutg341Tulu-Tigalarileft-to-rightZZ— Not in Unicode, approved for version 16.0[iii]
Ugar040Ugariticleft-to-right  Ugaritic4.031Ancient/historicCh 11.2
Vaii470Vaileft-to-right  Vai5.1300Ch 19.5
Visp280Visible Speechleft-to-rightZZ— Not in Unicode
Vith228Vithkuqileft-to-rightVithkuqi14.070Ancient/historicCh 8.12
Wara262Warang Citi (Varang Kshiti)left-to-right  Warang Citi7.084Ch 13.9
Wcho283Wancholeft-to-right  Wancho12.059Ch 13.16
Wole480WoleaimixedZZ— Not in Unicode, proposal is explored[i]
Xpeo030Old Persianleft-to-right  Old Persian4.150Ancient/historicCh 11.3
Xsux020Cuneiform, Sumero-Akkadianleft-to-right  Cuneiform5.01,234Ancient/historicCh 11.1
Yezi192Yezidiright-to-left script  Yezidi13.047Ancient/historicCh 9.6
Yiii460Yileft-to-right  Yi3.01,220Ch 18.7
Zanb339Zanabazar Square (Zanabazarin Dörböljin Useg, Xewtee Dörböljin Bicig, Horizontal Square Script)left-to-right  Zanabazar Square10.072Ancient/historicCh 14.6
Zinh994Code for inherited scriptInherited657
Zmth995Mathematical notationZZ— Not a 'script' in Unicode
Zsym996SymbolsZZ— Not a 'script' in Unicode
Zsye993Symbols (emoji variant)ZZ— Not a 'script' in Unicode
Zxxx997Code for unwritten documentsZZ— Not a 'script' in Unicode
Zyyy998Code for undetermined scriptCommon8,306
Zzzz999Code for uncoded scriptUnknown964,234In Unicode: All other code points
Notes
  1. ^
    ISO 15924 publications As of 12 September 2023
  2. ^
    ISO 15924 Normative text file As of 12 September 2023
  3. ^
    ISO 15924 Changes (including Aliases for Unicode; as of 12 September 2023)
  4. ^
    Unicode version 15.1
  5. ^
  6. ^
    Unicode uses the "Property Value Alias" (Alias) as the script-name. These Alias names are part of Unicode and are published informatively next to ISO 15924. An alias script name may be used in a character name: Palm, Palmyrene → U+10860 𐡠 PALMYRENE LETTER ALEPH.
  7. ^
    In Unicode, the Phoenician script is intended for the representation of text in Paleo-Hebrew, Archaic Phoenician, Phoenician, Early Aramaic, Late Phoenician cursive, Phoenician papyri, Siloam Hebrew, Hebrew seals, Ammonite, Moabite, and Punic.[vi]
References
  1. ^ a b c d e f g h i "SEI List of Scripts Not Yet Encoded". Unicode Consortium. March 2023. Retrieved 2023-09-25.
  2. ^ a b c d "Unicode Pipeline § Code Points Provisionally Assigned for Mature Proposals". Unicode Consortium. 2023-09-12. Retrieved 2023-09-25.
  3. ^ a b c d e f g "Unicode Pipeline § Approved for Publication in Version 16.0". Unicode Consortium. 2023-09-12. Retrieved 2023-09-25.
  4. ^ Michael Everson (1997-09-18). "Proposal to encode Klingon in Plane 1 of ISO/IEC 10646-2".
  5. ^ The Unicode Consortium (2001-08-14). "Approved Minutes of the UTC 87 / L2 184 Joint Meeting".
  6. ^ "Middle East-II, Ancient Scripts" (PDF). 15.0.0. The Unicode Consortium. Retrieved 2023-09-25.

Missing scripts in Unicode edit

With each new version of Unicode, new writing systems are added to the international character code. According to a statement by linguist Dr Deborah Anderson of UC Berkeley, there are over 100 writing systems that have not yet been included in Unicode.

According to a list of the project Missing Scripts by the University of Applied Sciences Mainz, Germany, the ANRT Nancy, France and UC Berkeley, USA, there are 294 known writing systems of mankind according to the current state of research (January 2022). 131 of them have not yet been encoded in Unicode, i.e. cannot yet be used on a computer or mobile phone.

See also edit

References edit

  1. ^ "Glossary". unicode.org.
  2. ^ "Unicode Character Database: Scripts". unicode.org.
  3. ^ "Chapter 14: Additional Ancient and Historic Scripts". The Unicode Standard, Version 15.0 (PDF). Mountain View, CA: Unicode, Inc. September 2022. ISBN 978-1-936213-32-0.
  4. ^ https://www.unicode.org/roadmaps/ Roadmaps to Unicode
  5. ^ "UAX #24: Unicode Script Property". www.unicode.org.

External links edit