Skip to content

Latest commit

 

History

History
245 lines (219 loc) · 17.7 KB

character-tables-hebrew.md

File metadata and controls

245 lines (219 loc) · 17.7 KB

Hebrew character tables

This document lists the per-character shaping information needed to shape Hebrew text.

Table of Contents

Separate character tables are provided for the Hebrew block, the Hebrew letters included in the Alphabetic Presentation Forms block, and for other miscellaneous characters that are used in <hebr> text runs:

The tables list each codepoint along with its Unicode general category. For marks, the table lists the codepoint's mark combining class. The codepoint's Unicode name and an example glyph are also provided.

Codepoints with no assigned meaning are designated as unassigned in the Unicode category column.

Hebrew character table

Codepoint Unicode category Mark class Glyph
U+0590 unassigned
U+0591 Mark [Mn] 220 ֑ Accent Etnahta
U+0592 Mark [Mn] 230 ֒ Accent Segol
U+0593 Mark [Mn] 230 ֓ Accent Shalshelet
U+0594 Mark [Mn] 230 ֔ Accent Zaqef Qatan
U+0595 Mark [Mn] 230 ֕ Accent Zaqef Gadol
U+0596 Mark [Mn] 220 ֖ Accent Tipeha
U+0597 Mark [Mn] 230 ֗ Accent Revia
U+0598 Mark [Mn] 230 ֘ Accent Zarqa
U+0599 Mark [Mn] 230 ֙ Accent Pashta
U+059A Mark [Mn] 222 ֚ Accent Yetiv
U+059B Mark [Mn] 220 ֛ Accent Tevir
U+059C Mark [Mn] 230 ֜ Accent Geresh
U+059D Mark [Mn] 230 ֝ Accent Geresh Muqdam
U+059E Mark [Mn] 230 ֞ Accent Gershayim
U+059F Mark [Mn] 230 ֟ Accent Qarney Para
U+05A0 Mark [Mn] 230 ֠ Accent Telisha Gedola
U+05A1 Mark [Mn] 230 ֡ Accent Pazer
U+05A2 Mark [Mn] 220 ֢ Accent Atnah Hafukh
U+05A3 Mark [Mn] 220 ֣ Accent Munah
U+05A4 Mark [Mn] 220 ֤ Accent Mahapakh
U+05A5 Mark [Mn] 220 ֥ Accent Merkha
U+05A6 Mark [Mn] 220 ֦ Accent Merkha Kefula
U+05A7 Mark [Mn] 220 ֧ Accent Darga
U+05A8 Mark [Mn] 230 ֨ Accent Qadma
U+05A9 Mark [Mn] 230 ֩ Accent Telisha Qetana
U+05AA Mark [Mn] 220 ֪ Accent Yerah Ben Yomo
U+05AB Mark [Mn] 230 ֫ Accent Ole
U+05AC Mark [Mn] 230 ֬ Accent Iluy
U+05AD Mark [Mn] 222 ֭ Accent Dehi
U+05AE Mark [Mn] 228 ֮ Accent Zinor
U+05AF Mark [Mn] 230 ֯ Mark Masora Circle
U+05B0 Mark [Mn] 10 ְ Point Sheva
U+05B1 Mark [Mn] 11 ֱ Point Hataf Segol
U+05B2 Mark [Mn] 12 ֲ Point Hataf Patah
U+05B3 Mark [Mn] 13 ֳ Point Hataf Qamats
U+05B4 Mark [Mn] 14 ִ Point Hiriq
U+05B5 Mark [Mn] 15 ֵ Point Tsere
U+05B6 Mark [Mn] 16 ֶ Point Segol
U+05B7 Mark [Mn] 17 ַ Point Patah
U+05B8 Mark [Mn] 18 ָ Point Qamats
U+05B9 Mark [Mn] 19 ֹ Point Holam
U+05BA Mark [Mn] 19 ֺ Point Holam Haser For Vav
U+05BB Mark [Mn] 20 ֻ Point Qubuts
U+05BC Mark [Mn] 21 ּ Point Dagesh Or Mapiq
U+05BD Mark [Mn] 22 ֽ Point Meteg
U+05BE Punctuation Dash 0 ־ Punctuation Maqaf
U+05BF Mark [Mn] 23 ֿ Point Rafe
U+05C0 Punctuation 0 ׀ Punctuation Paseq
U+05C1 Mark [Mn] 24 ׁ Point Shin Dot
U+05C2 Mark [Mn] 25 ׂ Point Sin Dot
U+05C3 Punctuation 0 ׃ Punctuation Sof Pasuq
U+05C4 Mark [Mn] 230 ׄ Mark Upper Dot
U+05C5 Mark [Mn] 220 ׅ Mark Lower Dot
U+05C6 Punctuation 0 ׆ Punctuation Nun Hafuka
U+05C7 Mark [Mn] 18 ׇ Point Qamats Qatan
U+05C8 unassigned
U+05C9 unassigned
U+05CA unassigned
U+05CB unassigned
U+05CC unassigned
U+05CD unassigned
U+05CE unassigned
U+05CF unassigned
U+05D0 Letter 0 א Alef
U+05D1 Letter 0 ב Bet
U+05D2 Letter 0 ג Gimel
U+05D3 Letter 0 ד Dalet
U+05D4 Letter 0 ה He
U+05D5 Letter 0 ו Vav
U+05D6 Letter 0 ז Zayin
U+05D7 Letter 0 ח Het
U+05D8 Letter 0 ט Tet
U+05D9 Letter 0 י Yod
U+05DA Letter 0 ך Final Kaf
U+05DB Letter 0 כ Kaf
U+05DC Letter 0 ל Lamed
U+05DD Letter 0 ם Final Mem
U+05DE Letter 0 מ Mem
U+05DF Letter 0 ן Final Nun
U+05E0 Letter 0 נ Nun
U+05E1 Letter 0 ס Samekh
U+05E2 Letter 0 ע Ayin
U+05E3 Letter 0 ף Final Pe
U+05E4 Letter 0 פ Pe
U+05E5 Letter 0 ץ Final Tsadi
U+05E6 Letter 0 צ Tsadi
U+05E7 Letter 0 ק Qof
U+05E8 Letter 0 ר Resh
U+05E9 Letter 0 ש Shin
U+05EA Letter 0 ת Tav
U+05EB unassigned
U+05EC unassigned
U+05ED unassigned
U+05EE unassigned
U+05EF Letter 0 ׯ Yod Triangle
U+05F0 Letter 0 װ Ligature Yiddish Double Vav
U+05F1 Letter 0 ױ Ligature Yiddish Vav Yod
U+05F2 Letter 0 ײ Ligature Yiddish Double Yod
U+05F3 Punctuation 0 ׳ Punctuation Geresh
U+05F4 Punctuation 0 ״ Punctuation Gershayim
U+05F5 unassigned
U+05F6 unassigned
U+05F7 unassigned
U+05F8 unassigned
U+05F9 unassigned
U+05FA unassigned
U+05FB unassigned
U+05FC unassigned
U+05FD unassigned
U+05FE unassigned
U+05FF unassigned

Alphabetic Presentation Forms character table

This chart includes only the Hebrew codepoints from the Alphabetic Presentation Forms block in Unicode.

The Composition column lists the codepoints from the Hebrew block that compose into the listed Alphabetic Presentation Form. These presentation form compositions are not covered by the standard Unicode composition algorithm.

Entries with a null in this column do not need to be composed by the shaping engine.

Codepoint Unicode category Mark class Composition Glyph
U+FB1D Letter 0 U+05D9,U+05B4 יִ Yod With Hiriq
U+FB1E Mark [Mn] 26 null ﬞ Point Juedo-Spanish Varika
U+FB1F Letter 0 U+05F2,U+05B7 ײַ Ligature Yiddish Yod Yod Patah
U+FB20 Letter 0 null ﬠ Alternative Ayin
U+FB21 Letter 0 null ﬡ Wide Alef
U+FB22 Letter 0 null ﬢ Wide Dalet
U+FB23 Letter 0 null ﬣ Wide He
U+FB24 Letter 0 null ﬤ Wide Kaf
U+FB25 Letter 0 null ﬥ Wide Lamed
U+FB26 Letter 0 null ﬦ Wide Final Mem
U+FB27 Letter 0 null ﬧ Wide Resh
U+FB28 Letter 0 null ﬨ Wide Tav
U+FB29 Letter 0 null ﬩ Alternative Plus Sign
U+FB2A Letter 0 U+05E9,U+05C1 שׁ Shin With Shin Dot
U+FB2B Letter 0 U+05E9,U+05C2 שׂ Shin With Sin Dot
U+FB2C Letter 0 U+FB2A,U+05BC OR U+FB49,U+05C1 שּׁ Shin With Dagesh And Shin Dot
U+FB2D Letter 0 U+FB2B,U+05BC OR U+FB49,U+05C2 שּׂ Shin With Dagesh And Sin Dot
U+FB2E Letter 0 U+05D0,U+05B7 אַ Alef With Patah
U+FB2F Letter 0 U+05D0,U+05B8 אָ Alef With Qamats
U+FB30 Letter 0 U+05D0,U+05BC אּ Alef With Mapiq
U+FB31 Letter 0 U+05D1,U+05BC בּ Bet With Dagesh
U+FB32 Letter 0 U+05D2,U+05BC גּ Gimel With Dagesh
U+FB33 Letter 0 U+05D3,U+05BC דּ Dalet With Dagesh
U+FB34 Letter 0 U+05D4,U+05BC הּ He With Mapiq
U+FB35 Letter 0 U+05D5,U+05BC וּ Vav With Dagesh
U+FB36 Letter 0 U+05D6,U+05BC זּ Zayin With Dagesh
U+FB37 unassigned
U+FB38 Letter 0 U+05D8,U+05BC טּ Tet With Dagesh
U+FB39 Letter 0 U+05D9,U+05BC יּ Yod With Dagesh
U+FB3A Letter 0 U+05DA,U+05BC ךּ Final Kaf With Dagesh
U+FB3B Letter 0 U+05DB,U+05BC כּ Kaf With Dagesh
U+FB3C Letter 0 U+05DC,U+05BC לּ Lamed With Dagesh
U+FB3D unassigned
U+FB3E Letter 0 U+05DE,U+05BC מּ Mem With Dagesh
U+FB3F unassigned
U+FB40 Letter 0 U+05E0,U+05BC נּ Nun With Dagesh
U+FB41 Letter 0 U+05E1,U+05BC סּ Samekh With Dagesh
U+FB42 unassigned
U+FB43 Letter 0 U+05E3,U+05BC ףּ Final Pe With Dagesh
U+FB44 Letter 0 U+05E4,U+05BC פּ Pe With Dagesh
U+FB45 unassigned
U+FB46 Letter 0 U+05E6,U+05BC צּ Tsadi With Dagesh
U+FB47 Letter 0 U+05E7,U+05BC קּ Qof With Dagesh
U+FB48 Letter 0 U+05E8,U+05BC רּ Resh With Dagesh
U+FB49 Letter 0 U+05E9,U+05BC שּ Shin With Dagesh
U+FB4A Letter 0 U+05EA,U+05BC תּ Tav With Dagesh
U+FB4B Letter 0 U+05D5,U+05B9 וֹ Vav With Holam
U+FB4C Letter 0 U+05D1,U+05BF בֿ Bet With Rafe
U+FB4D Letter 0 U+05DB,U+05BF כֿ Kaf With Rafe
U+FB4E Letter 0 U+05E4,U+05BF פֿ Pe With Rafe
U+FB4F Letter 0 null ﭏ Ligature Alef Lamed

Miscellaneous character table

Other important characters that may be encountered when shaping runs of Hebrew text include the dotted-circle placeholder (U+25CC), the zero-width joiner (U+200D), and zero-width non-joiner (U+200C).

The dotted-circle placeholder is frequently used when displaying a mark in isolation. Real-world text may also use other characters, such as hyphens or dashes, in a similar placeholder fashion; shaping engines should cope with this situation gracefully.

Codepoint Unicode category Mark class Glyph
U+00A0 Separator 0   No-break space
U+034F Other 0 ͏ Combining grapheme joiner
U+200C Other 0 ‌ Zero-width non-joiner
U+200D Other 0 ‍ Zero-width joiner
U+200E Other 0 ‎ Left-to-Right marker
U+200F Other 0 ‏ Right-to-Left marker
U+25CC Symbol 0 ◌ Dotted circle