| Lu | L · Letter | Letter, uppercase | Uppercase letters in cased scripts. | A B C Ω Ж |
| Ll | L · Letter | Letter, lowercase | Lowercase letters in cased scripts. | a b c α ж |
| Lt | L · Letter | Letter, titlecase | The 31 Croatian/Serbian Latin digraphs and a handful of archaic forms used for titlecasing. | Dž Lj Nj Dz |
| Lm | L · Letter | Letter, modifier | Small letters that modify the preceding base character. | ʰ ʱ ʲ ˇ ˆ |
| Lo | L · Letter | Letter, other | Letters in scripts with no case distinction — CJK, Arabic, Hebrew, Indic. | 中 あ א ا |
| Mn | M · Mark | Mark, nonspacing | Combining marks that occupy no horizontal advance (most diacritics). | ◌̀ ◌́ ◌̃ ◌̈ |
| Mc | M · Mark | Mark, spacing combining | Combining marks that do consume horizontal width — chiefly Indic vowel signs. | ा ि ी |
| Me | M · Mark | Mark, enclosing | Combining marks that enclose the preceding character. | ◌⃝ ◌⃞ |
| Nd | N · Number | Number, decimal digit | Codepoints in the 0–9 cycle of any decimal-digit script. There are 700+ of these. | 0 1 2 ٠ ١ २ |
| Nl | N · Number | Number, letter | Numerals that are also letters — Roman numerals, Greek acrophonic, Counting Rod. | Ⅰ Ⅱ Ⅲ Ⅳ |
| No | N · Number | Number, other | Fractions, superscript and subscript digits, circled and parenthesised digits. | ½ ¾ ① ② ³ |
| Pc | P · Punctuation | Punctuation, connector | Connecting punctuation, like the underscore, used to join words inside identifiers. | _ ‿ ⁀ |
| Pd | P · Punctuation | Punctuation, dash | The full dash family — hyphen, en dash, em dash, horizontal bar and minus-like dashes. | - – — ⸺ |
| Ps | P · Punctuation | Punctuation, open | Opening brackets, parentheses and CJK corner brackets. | ( [ { 「 |
| Pe | P · Punctuation | Punctuation, close | Closing brackets — the mirror partners of the Ps group. | ) ] } 」 |
| Pi | P · Punctuation | Punctuation, initial quote | Opening quotation marks, including French and German guillemets and curly quotes. | “ ‘ ‹ « |
| Pf | P · Punctuation | Punctuation, final quote | Closing quotation marks. Note that ASCII " and ' are Po, not Pf. | ” ’ › » |
| Po | P · Punctuation | Punctuation, other | Everything else: periods, commas, semicolons, the ampersand, the ASCII straight quotes. | . , ; : ! ? @ & |
| Sm | S · Symbol | Symbol, math | Mathematical operators, relational symbols, set-theory glyphs. | + < = ± ∞ ∑ |
| Sc | S · Symbol | Symbol, currency | All 60+ currency signs, from dollar to Indian rupee to the new bitcoin. | $ € £ ¥ ₹ |
| Sk | S · Symbol | Symbol, modifier | Spacing letter-modifier symbols — the standalone forms of accent marks. | ` ^ ¨ ¯ ´ |
| So | S · Symbol | Symbol, other | Everything pictographic — copyright, trademark, dingbats, most emoji. | © ® ™ ☃ ♥ 🌍 |
| Zs | Z · Separator | Separator, space | Horizontal whitespace — ASCII space, NBSP, en quad, em quad, hair space. | U+0020 U+00A0 U+2000 |
| Zl | Z · Separator | Separator, line | A single codepoint that marks a line separator. | U+2028 |
| Zp | Z · Separator | Separator, paragraph | A single codepoint that marks a paragraph separator. | U+2029 |
| Cc | C · Other | Other, control | The 65 ASCII and C1 control codes inherited from ISO/IEC 6429. | U+0000–001F U+007F–009F |
| Cf | C · Other | Other, format | Invisible formatting codepoints — ZWJ, ZWNJ, BOM, bidi controls, soft hyphen. | ZWJ ZWNJ BOM U+200B |
| Cs | C · Other | Other, surrogate | The 2,048 surrogate codepoints. Used by UTF-16; never assigned to characters. | U+D800–U+DFFF |
| Co | C · Other | Other, private use | The three Private Use Areas, totalling 137,468 codepoints reserved for unofficial use. | E000–F8FF PUA-A PUA-B |
| Cn | C · Other | Other, unassigned | Every codepoint not yet assigned to a character. Includes all 66 noncharacters. | — (~819,000 codepoints) |