[an error occurred while processing this directive] [an error occurred while processing this directive]
Mercifully short character code overview. An apparently simple subject which turns out to be brutally complicated - well to our modest brains.
This stuff (and more) is contained in Chapter 24 of the current HTML 4.01 spec (the last ever version of HTML - its all XHTML now) which you can get (and lots of other great stuff) from the W3C site including their fantastic page validation services.
If you find it useful - print it, stick it on your wall - and save our bandwidth!!
The reason you can never find this stuff is because the most popular values confusingly belong to a number of character sets.
There are two encoding forms supported by most browsers
Both formats are shown in the tables below. If you are using anything obscure or have to deal with old browsers (< v3 ish) you should stick to the numeric format only.
For convenience we show some popular values (from a number of character sets) in a single table then the full Latin-1 (ISO8859-1) table. If you are into the greek alphabet for obscure mathematical symbols - you are out of luck. Go the the W3C click on HTML then go for Chapter 24 of the latest spec - they are all there.
This lot should all display on most browsers. However in the tables below the column DN shows the result of using the name format encoding in your browser and the column D# shows the result using the numeric format. If there is no character in one or other of these column (except the space, dummy!) then your browser does not handle that format correctly.
Commonly used character entity references | |||||
DN | Name format | D# | Numeric format | Description | Char set |
|   | non-breaking space | ISO8559-1 | ||
© | © | © | © | copyright sign | ISO8559-1 |
® | ® | ® | ® | registered trade mark sign | ISO8559-1 |
° | ° | ° | ° | degree sign | ISO8559-1 |
² | ² | ² | ² | superscript 2 (squared) | ISO8559-1 |
³ | ³ | ³ | ³ | superscript 3 (cubed) | ISO8559-1 |
" | " | " | " | quotation mark | ISO10646 |
& | & | & | & | ampersand sign | ISO10646 |
< | < | < | < | less than sign | ISO10646 |
> | > | > | > | greater than sign | ISO10646 |
– | – | – | – | en dash | ISO10646 |
— | — | — | — | em dash | ISO10646 |
‘ | ‘ | ‘ | ‘ | left single quote | ISO10646 |
’ | ’ | ’ | ’ | right single quote, apostrophe | ISO10646 |
“ | “ | “ | “ | left double quotation mark | ISO10646 |
” | ” | ” | ” | right double quotation mark | ISO10646 |
• | • | • | • | small black circle, bullet | ISO10646 |
† | † | † | † | dagger sign | ISO10646 |
‡ | ‡ | ‡ | ‡ | double dagger sign | ISO10646 |
′ | ′ | ′ | ′ | prime = minutes = feet | ISO10646 |
″ | ″ | ″ | ″ | double prime = seconds = inches | ISO10646 |
‹ | ‹ | ‹ | ‹ | single left pointing angle quote | ISO10646 |
› | › | › | › | single right pointing angle quote | ISO10646 |
€ | € | € | € | euro sign | ISO10646 |
™ | ™ | ™ | ™ | Registered Trademark sign | ISO10646 |
⊕ | ⊕ | ⊕ | ⊕ | circled plus = direct sum | ISO10646 |
⊗ | ⊗ | ⊗ | ⊗ | circled times = vector product | ISO10646 |
˜ | ˜ | ˜ | ˜ | tilde sign | ISO10646 |
ˆ | ˆ | ˆ | ˆ | circumflex (or caret) sign | ISO10646 |
★ | ★ | black star | ISO10646 | ||
☆ | ☆ | empty star | ISO10646 | ||
♠ | ♠ | ♠ | ♠ | black spade suit | ISO10646 |
♣ | ♣ | ♣ | ♣ | black clubs suit | ISO10646 |
♥ | ♥ | ♥ | ♥ | black heart suit | ISO10646 |
♦ | ♦ | ♦ | ♦ | black diamonds suit | ISO10646 |
◊ | ◊ | ◊ | ◊ | lozenge | ISO10646 |
← | ← | ← | ← | left arrow | ISO10646 |
→ | → | → | → | right arrow | ISO10646 |
↑ | ↑ | ↑ | ↑ | up arrow | ISO10646 |
↓ | ↓ | ↓ | ↓ | down arrow | ISO10646 |
↔ | ↔ | ↔ | ↔ | right-left arrow | ISO10646 |
¬ | ¬ | ¬ | ¬ | NOT sign | ISO8859-1 |
Here is the full list of character entities for accented characters and miscellaneous symbols in the Latin-1 (ISO 8859-1) character set. Values in the range 00 to 7F (the ASCII/IA5 set in this table) are typically used as raw characters with the exceptions of the special characters used for HTML encoding as shown above (<, >, & and ") but they can be represented as HTML entities by using the ASCII/IA5 decimal number, for example ',' (comma) has a decimal vale of 44 (in the ASCII/IA5 table and may be represented as a HTML entity by encoding as ,.
Latin-1 (ISO8859-1) character entity references | ||||
DN | Name format | D# | Numeric format | Description |
| | | | | | vertical bar | |
|   | non-breaking space | ||
¡ | ¡ | ¡ | ¡ | inverted exclamation mark |
¢ | ¢ | ¢ | ¢ | cent sign |
£ | £ | £ | £ | pound sign |
¤ | ¤ | ¤ | ¤ | currency sign |
¥ | ¥ | ¥ | ¥ | yen sign = yuan sign |
¦ | ¦ | ¦ | ¦ | broken vertical bar |
§ | § | § | § | section sign |
¨ | ¨ | ¨ | ¨ | diaeresis = spacing diaeresis |
© | © | © | © | copyright sign |
ª | ª | ª | ª | feminine ordinal indicator |
« | « | « | « | left-pointing double angle quotes (left pointing quillemet) |
¬ | ¬ | ¬ | ¬ | not sign |
| ­ | | ­ | soft hyphen |
® | ® | ® | ® | registered sign |
¯ | ¯ | ¯ | ¯ | macron = spacing macron |
° | ° | ° | ° | degree sign |
± | ± | ± | ± | plus-minus sign |
² | ² | ² | ² | superscript two (squared) |
³ | ³ | ³ | ³ | superscript three (cubed) |
´ | ´ | ´ | ´ | acute accent |
µ | µ | µ | µ | micro sign |
¶ | ¶ | ¶ | ¶ | paragraph sign = pilcrow sign |
· | · | · | · | middle dot = georgian comma |
¸ | ¸ | ¸ | ¸ | cedilla sign |
¹ | ¹ | ¹ | ¹ | superscript one |
º | º | º | º | masculine ordinal indicator |
» | » | » | » | right-pointing double angle quotes (right pointing quillemet) |
¼ | ¼ | ¼ | ¼ | vulgar fraction one quarter |
½ | ½ | ½ | ½ | vulgar fraction one half |
¾ | ¾ | ¾ | ¾ | vulgar fraction three quarters |
¿ | ¿ | ¿ | ¿ | inverted question mark |
À | À | À | À | latin capital A with grave accent |
Á | Á | Á | Á | latin capital A with acute accent |
 |  |  |  | latin capital A with circumflex |
à | à | à | à | latin capital A with tilde |
Ä | Ä | Ä | Ä | latin capital A with diaeresis |
Å | Å | Å | Å | latin capital A with ring |
Æ | Æ | Æ | Æ | latin capital AE |
Ç | Ç | Ç | Ç | latin capital C with cedilla |
È | È | È | È | latin capital E with grave accent |
É | É | É | É | latin capital E with acute accent |
Ê | Ê | Ê | Ê | latin capital E with circumflex |
Ë | Ë | Ë | Ë | latin capital E with diaeresis |
Ì | Ì | Ì | Ì | latin capital I with grave accent |
Í | Í | Í | Í | latin capital I with acute accent |
Î | Î | Î | Î | latin capital I with circumflex |
Ï | Ï | Ï | Ï | latin capital I with diaeresis |
Ð | Ð | Ð | Ð | latin capital letter ETH |
Ñ | Ñ | Ñ | Ñ | latin capital N with tilde |
Ò | Ò | Ò | Ò | latin capital O with grave accent |
Ó | Ó | Ó | Ó | latin capital O with acute accent |
Ô | Ô | Ô | Ô | latin capital O with circumflex |
Õ | Õ | Õ | Õ | latin capital O with tilde |
Ö | Ö | Ö | Ö | latin capital O with diaeresis |
× | × | × | × | multiplication sign |
Ø | Ø | Ø | Ø | latin capital O with stroke |
Ù | Ù | Ù | Ù | latin capital U with grave accent |
Ú | Ú | Ú | Ú | latin capital U with acute accent |
Û | Û | Û | Û | latin capital U with circumflex |
Ü | &Uml; | Ü | Ü | latin capital U with diaeresis |
Ý | Ý | Ý | Ý | latin capital Y with acute accent |
Þ | Þ | Þ | Þ | latin capital THORN |
ß | ß | ß | ß | latin small letter sharp s |
à | à | à | à | latin small letter a with grave accent |
á | á | á | á | latin small letter a with acute accent |
â | â | â | â | latin small letter a with circumflex |
ã | ã | ã | ã | latin small letter a with tilde |
ä | ä | ä | ä | latin small letter a with diaeresis |
å | å | å | å | latin small letter a with ring |
æ | æ | æ | æ | latin small letter ae |
ç | ç | ç | ç | latin small letter c with cedilla |
è | è | è | è | latin small letter e with grave accent |
é | é | é | é | latin small letter e with acute accent |
ê | ê | ê | ê | latin small letter e with circumflex |
ë | ë | ë | ë | latin small letter e with diaeresis |
ì | ì | ì | ì | latin small letter i with grave accent |
í | í | í | í | latin small letter i with acute accent |
î | î | î | î | latin small letter i with circumflex |
ï | ï | ï | ï | latin small letter i with diaeresis |
ð | ð | ð | ð | latin small letter eth |
ñ | ñ | ñ | ñ | latin small letter n with tilde |
ò | ò | ò | ò | latin small letter 0 with grave accent |
ó | ó | ó | ó | latin small letter 0 with acute accent |
ô | ô | ô | ô | latin small letter 0 with circumflex |
õ | õ | õ | õ | latin small letter 0 with tilde |
ö | ö | ö | ö | latin small letter 0 with diaeresis |
÷ | ÷ | ÷ | ÷ | division sign |
ø | ø | ø | ø | latin small letter 0 with stroke |
ù | ù | ù | ù | latin small letter u with grave accent |
ú | ú | ú | ú | latin small letter u with acute accent |
û | û | û | û | latin small letter u with circumflex |
ü | ü | ü | ü | latin small letter u with diareresis |
ý | ý | ý | ý | latin small letter y with acute accent |
þ | þ | þ | þ | latin small letter thorn |
ÿ | ÿ | ÿ | ÿ | latin small letter y with diaeresis |