Cyrillic script in Unicode

  • the cyrillic script
    slavic letters
    АБВГҐДЂ
    ЃЕЀЁЄЖЗ
    З́ЅИЍІЇЙ
    ЈКЛЉМНЊ
    ОПРСС́ТЋ
    ЌУЎФХЦЧ
    ЏШЩЪЫЬЭ
    ЮЯ
    non-slavic letters
    А́ А̀ӐА̄А̊А̃Ӓ
    Ӓ̄ В̌ӘӘ́Ә̃ӚӔ
    ҒГ̧Г̑Г̄ Г̣ Г̌Ҕ
    Ӻ Ғ̌ӶԀԂ
    Д̆ Д̣ԪԬД̆Ӗ
    Е̄Е̃Ё̄Є̈ӁҖ
    ӜԄҘӞ З̌ З̱ З̣
    ԐԐ̈ӠԆӢИ̃Ҋ
    Ӥ И́ҚӃҠҞҜ
    Ԟ К̣ԚӅԮԒԠ
    ԈԔӍӉҢԨӇ
    ҤԢԊ О́ О̀О̆ О̂
    О̃О̄Ӧ Ӧ̄ӨӨ̄ Ө́
    Ө̆ӪҨԤҦР̌Ҏ
    ԖҪ С̣ С̱Ԍ Т̌ Т̣
    ҬԎУ̃Ӯ
    Ӱ ӲҮҮ́Ұ
    Х̑ҲӼӾҺ
    ԦҴҶӴ
    ӋҸҼҾ
    Ы̆Ы̄ӸҌЭ̆Э̄Э̇
    ӬӬ́Ӭ̄Ю̆Ю̈ Ю̄
    Я̆Я̄Я̈ԘԜӀ
    archaic letters
    ҀѺ
    ѸѠѼѾ
    ѢѤѦ
    ѪѨѬѮ
    ѰѲѴѶ
    • list of cyrillic letters
    • cyrillic digraphs

    as of unicode version 13.0 cyrillic script is encoded across several blocks, all in the bmp:

    • u+0400–u+04ff, 256 characters
    • u+0500–u+052f, 48 characters
    • u+2de0–u+2dff, 32 characters
    • u+a640–u+a69f, 96 characters
    • u+1c80–u+1c8f, 9 characters
    • u+1d2b, u+1d78, 2 cyrillic characters
    • u+fe2e–u+fe2f, 2 cyrillic characters

    the characters in the range u+0400–u+045f are basically the characters from iso 8859-5 moved upward by 864 positions. the next characters in the cyrillic block, range u+0460–u+0489, are historical letters, some being still used for church slavonic. the characters in the range u+048a–u+04ff and the complete cyrillic supplement block (u+0500-u+052f) are additional letters for various languages that are written with cyrillic script. two characters in the block phonetic extensions block complete the uralic phonetic alphabet: u+1d2b cyrillic letter small capital el and u+1d78 modifier letter cyrillic en.

    unicode includes few precomposed accented cyrillic letters; the others can be combined by adding u+0301 ("combining acute accent") after the accented vowel (e.g., ы́ э́ ю́ я́) (see below).

    the following two diacritical marks not specific to cyrillic can be used with cyrillic text:

    • u+0301 ◌́ combining acute accent (= cyrillic stress mark), in u+0300–u+036f
    • u+20dd ◌⃝ combining enclosing circle (= cyrillic ten thousands sign), in u+20d0–u+20f0

    in the table below, small letters are ordered according to their unicode numbers; capital letters are placed immediately before the corresponding small letters. standard unicode names and canonical decompositions are included.

  • table of characters
  • blocks
  • see also
  • references
  • further reading

The Cyrillic script
Slavic letters
АБВГҐДЂ
ЃЕЀЁЄЖЗ
З́ЅИЍІЇЙ
ЈКЛЉМНЊ
ОПРСС́ТЋ
ЌУЎФХЦЧ
ЏШЩЪЫЬЭ
ЮЯ
Non-Slavic letters
А́ А̀ӐА̄А̊А̃Ӓ
Ӓ̄ В̌ӘӘ́Ә̃ӚӔ
ҒГ̧Г̑Г̄ Г̣ Г̌Ҕ
Ӻ Ғ̌ӶԀԂ
Д̆ Д̣ԪԬД̆Ӗ
Е̄Е̃Ё̄Є̈ӁҖ
ӜԄҘӞ З̌ З̱ З̣
ԐԐ̈ӠԆӢИ̃Ҋ
Ӥ И́ҚӃҠҞҜ
Ԟ К̣ԚӅԮԒԠ
ԈԔӍӉҢԨӇ
ҤԢԊ О́ О̀О̆ О̂
О̃О̄Ӧ Ӧ̄ӨӨ̄ Ө́
Ө̆ӪҨԤҦР̌Ҏ
ԖҪ С̣ С̱Ԍ Т̌ Т̣
ҬԎУ̃Ӯ
Ӱ ӲҮҮ́Ұ
Х̑ҲӼӾҺ
ԦҴҶӴ
ӋҸҼҾ
Ы̆Ы̄ӸҌЭ̆Э̄Э̇
ӬӬ́Ӭ̄Ю̆Ю̈ Ю̄
Я̆Я̄Я̈ԘԜӀ
Archaic letters
ҀѺ
ѸѠѼѾ
ѢѤѦ
ѪѨѬѮ
ѰѲѴѶ

As of Unicode version 13.0 Cyrillic script is encoded across several blocks, all in the BMP:

The characters in the range U+0400–U+045F are basically the characters from ISO 8859-5 moved upward by 864 positions. The next characters in the Cyrillic block, range U+0460–U+0489, are historical letters, some being still used for Church Slavonic. The characters in the range U+048A–U+04FF and the complete Cyrillic Supplement block (U+0500-U+052F) are additional letters for various languages that are written with Cyrillic script. Two characters in the block Phonetic Extensions block complete the Uralic Phonetic Alphabet: U+1D2B CYRILLIC LETTER SMALL CAPITAL EL and U+1D78 MODIFIER LETTER CYRILLIC EN.

Unicode includes few precomposed accented Cyrillic letters; the others can be combined by adding U+0301 ("combining acute accent") after the accented vowel (e.g., ы́ э́ ю́ я́) (see below).

The following two diacritical marks not specific to Cyrillic can be used with Cyrillic text:

  • U+0301 ◌́ COMBINING ACUTE ACCENT (= Cyrillic stress mark), in U+0300–U+036F
  • U+20DD ◌⃝ COMBINING ENCLOSING CIRCLE (= Cyrillic ten thousands sign), in U+20D0–U+20F0

In the table below, small letters are ordered according to their Unicode numbers; capital letters are placed immediately before the corresponding small letters. Standard Unicode names and canonical decompositions are included.