Every character, fully documented.
Look up any of the 154,998 characters in Unicode by name, codepoint, or by pasting the glyph itself.
PASTE A GLYPH · TYPE A NAME · ENTER U+XXXX · ENTER ↵
Unicode is the agreement that lets a Greek letter look the same in a French textbook, a Japanese website, and a Brazilian government form. Every character — every letter, digit, punctuation mark, symbol, and emoji you can type or read on a screen — has a number assigned to it. That number is its codepoint.
This site is a working reference for those characters. Each page documents one character, one block, or one technique, with the kind of detail you actually need: the codepoint in decimal and hex, the official name, the script and category, the HTML and CSS escapes, the UTF-8 and UTF-16 byte sequences, related characters, and notes on usage and history. No fluff. No pop-ups. Just the reference.
Codepoint converter
Convert between decimal, hexadecimal, U+XXXX notation, and the character itself.
02UTF-8 encoder
See the byte sequence for any text in UTF-8, UTF-16, and UTF-32.
03HTML entity encoder
Convert text to named, numeric, and hexadecimal HTML entities.
04Character inspector
Paste any string and see every codepoint, name, category, and byte length.
05URL encoder
Percent-encode and decode strings for safe use in URLs and queries.
06Unicode normalizer
Apply NFC, NFD, NFKC, or NFKD normalization to any text.
What is Unicode?
A plain-English introduction to the character standard that powers the modern internet — codepoints, planes, encodings, and why it matters.
EncodingUTF-8, UTF-16, UTF-32 compared
How the three Unicode Transformation Formats actually differ, when each is used, and why UTF-8 won the web.
ConceptsCodepoint, character, glyph, grapheme
Four words for things that look similar but aren't. Get them right and Unicode stops being mysterious.
ReferenceUnicode normalization explained
NFC, NFD, NFKC, NFKD — what they do, when to use them, and the bugs they fix.
EmojiHow emoji work
Skin tones, ZWJ sequences, variation selectors — the surprising mechanics inside a single 👨👩👧👦.
WebHTML entities & escapes
Named, numeric, hexadecimal. Which you need, which to avoid, and how character references work in browsers.
LayoutBidirectional text & RTL
Why Arabic and Hebrew break layouts, and the algorithm that fixes it.
CuriositiesThe Private Use Areas
137,468 codepoints reserved for anyone to use. Tolkien's Elvish lives here.