BLOCK · U+0080 – U+00FF · 128 CHARS

Latin-1 Supplement

The second 128 codepoints of Unicode — accented Latin letters, common symbols, and an awkward block of mostly-unused C1 control codes.

The Latin-1 Supplement block carries the second hundred-and-twenty-eight codepoints of Unicode, U+0080 through U+00FF. Like Basic Latin before it, every codepoint in this block is inherited directly from an earlier standard: ISO 8859-1, commonly called "Latin-1", which extended seven-bit ASCII to eight bits by adding the accented letters needed for Western European languages — French, Spanish, Portuguese, Italian, German, Dutch, Icelandic, Nordic — plus a small set of symbols and punctuation marks that had been missing from the original ASCII set.

About this block

ISO 8859-1 was published in 1987 by the International Organization for Standardization, with the printable characters running from byte 0xA0 through 0xFF and the range 0x80–0x9F reserved for so-called C1 control codes. Unicode preserved this layout exactly when it adopted Latin-1, which is why the C1 controls — U+0080 through U+009F — still occupy the first half of this block. They were defined by ISO 6429 / ECMA-48 for terminal control alongside the C0 codes in Basic Latin, but unlike LF and ESC, almost none of them ever saw real use. Most Unicode-aware software treats them as opaque format characters and renders nothing.

Printable Latin-1 begins at U+00A0 NO-BREAK SPACE — the non-breaking space familiar to anyone who has written   in HTML. Past that comes the punctuation row (¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ ® ¯), the math and superscript row (° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿), and then the accented-letter rows: uppercase from U+00C0 (À) through U+00DE (Þ), the multiplication sign U+00D7 as an unfortunate stowaway in the middle, then lowercase from U+00DF (ß) through U+00FF (ÿ), with the division sign U+00F7 sitting in the parallel position. This grid layout was deliberate: each uppercase letter sits exactly 32 codepoints before its lowercase counterpart, mirroring ASCII.

The most consequential symbols here are the typographic and legal marks used everywhere on Western web pages. © (U+00A9), ® (U+00AE), ° (U+00B0), ± (U+00B1), £ (U+00A3), and the vulgar fractions ¼ ½ ¾ all live in this block. The accented vowels — é, è, ñ, ü, å, ø — power every European text. The German ß "sharp s" sits at U+00DF; its modern uppercase counterpart ẞ lives much later, at U+1E9E.

One persistent source of bugs: Windows-1252 is not Latin-1. Microsoft's "ANSI" code page reuses the same upper range but fills the C1 area (0x80–0x9F) with printable characters — the Euro sign €, smart quotes ‘ ’ “ ”, the em dash —, the bullet •, the ellipsis …, and the trademark ™. When a Windows-1252 byte stream is decoded as Latin-1 (or, worse, mis-tagged in HTTP headers), those characters become control codes, vanish, or render as mojibake. Modern code should declare UTF-8 and stop interpreting bytes as Latin-1 entirely — but the legacy is still very much alive in old databases, mail archives, and CSV exports.

Notable characters

U+00A0 No-Break Space U+00A3£Pound Sign U+00A5¥Yen Sign U+00A7§Section Sign U+00A9©Copyright Sign U+00AE®Registered Sign U+00B0°Degree Sign U+00B1±Plus-Minus Sign U+00BC¼Vulgar Fraction One Quarter U+00BD½Vulgar Fraction One Half U+00C6ÆLatin AE U+00D7×Multiplication Sign U+00DFßSharp S U+00E9éLatin e Acute U+00F1ñLatin n Tilde U+00F7÷Division Sign

Every character in the block

U+0080·C1 PAD U+0081·C1 HOP U+0082·C1 BPH U+0083·C1 NBH U+0084·C1 IND U+0085·NEL Next Line U+0086·C1 SSA U+0087·C1 ESA U+0088·C1 HTS U+0089·C1 HTJ U+008A·C1 VTS U+008B·C1 PLD U+008C·C1 PLU U+008D·C1 RI U+008E·C1 SS2 U+008F·C1 SS3 U+0090·C1 DCS U+0091·C1 PU1 U+0092·C1 PU2 U+0093·C1 STS U+0094·C1 CCH U+0095·C1 MW U+0096·C1 SPA U+0097·C1 EPA U+0098·C1 SOS U+0099·C1 SGCI U+009A·C1 SCI U+009B·C1 CSI U+009C·C1 ST U+009D·C1 OSC U+009E·C1 PM U+009F·C1 APC U+00A0 No-Break Space U+00A1¡Inverted Exclamation U+00A2¢Cent Sign U+00A3£Pound Sign U+00A4¤Currency Sign U+00A5¥Yen Sign U+00A6¦Broken Bar U+00A7§Section Sign U+00A8¨Diaeresis U+00A9©Copyright Sign U+00AAªFeminine Ordinal U+00AB«Left Guillemet U+00AC¬Not Sign U+00ADSoft Hyphen U+00AE®Registered Sign U+00AF¯Macron U+00B0°Degree Sign U+00B1±Plus-Minus Sign U+00B2²Superscript Two U+00B3³Superscript Three U+00B4´Acute Accent U+00B5µMicro Sign U+00B6¶Pilcrow Sign U+00B7·Middle Dot U+00B8¸Cedilla U+00B9¹Superscript One U+00BAºMasculine Ordinal U+00BB»Right Guillemet U+00BC¼One Quarter U+00BD½One Half U+00BE¾Three Quarters U+00BF¿Inverted Question U+00C0ÀA Grave U+00C1ÁA Acute U+00C2ÂA Circumflex U+00C3ÃA Tilde U+00C4ÄA Diaeresis U+00C5ÅA Ring U+00C6ÆAE U+00C7ÇC Cedilla U+00C8ÈE Grave U+00C9ÉE Acute U+00CAÊE Circumflex U+00CBËE Diaeresis U+00CCÌI Grave U+00CDÍI Acute U+00CEÎI Circumflex U+00CFÏI Diaeresis U+00D0ÐEth U+00D1ÑN Tilde U+00D2ÒO Grave U+00D3ÓO Acute U+00D4ÔO Circumflex U+00D5ÕO Tilde U+00D6ÖO Diaeresis U+00D7×Multiplication U+00D8ØO Stroke U+00D9ÙU Grave U+00DAÚU Acute U+00DBÛU Circumflex U+00DCÜU Diaeresis U+00DDÝY Acute U+00DEÞThorn U+00DFßSharp S U+00E0àa Grave U+00E1áa Acute U+00E2âa Circumflex U+00E3ãa Tilde U+00E4äa Diaeresis U+00E5åa Ring U+00E6æae U+00E7çc Cedilla U+00E8èe Grave U+00E9ée Acute U+00EAêe Circumflex U+00EBëe Diaeresis U+00ECìi Grave U+00EDíi Acute U+00EEîi Circumflex U+00EFïi Diaeresis U+00F0ðeth U+00F1ñn Tilde U+00F2òo Grave U+00F3óo Acute U+00F4ôo Circumflex U+00F5õo Tilde U+00F6öo Diaeresis U+00F7÷Division Sign U+00F8øo Stroke U+00F9ùu Grave U+00FAúu Acute U+00FBûu Circumflex U+00FCüu Diaeresis U+00FDýy Acute U+00FEþthorn U+00FFÿy Diaeresis

About this block

Notable characters

Every character in the block

Related blocks