Unicode as defined by the Unicode organization has become a universal standard: ISO/IEC 10646, describing the 'Universal Multiple-Octet Coded Character Set' (UCS). See: http://www.unicode.org
In mime encoded content, after the "Content-type:" header you will often see a "charset="
charset=UTF-8. This includes all the unicode characters but it encodes them in an efficient way which makes it possible to transfer a Unicode character to another computer reliably. UTF-8 stands for UCS Transformation Format 8. See:
charset=windows-1252 The standard Windows Roman encoding is 'code page 1252'.See: ftp://ftp.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1252.TXT
ASCII is the first 127 Unicode Characters
ISO Latin-1 is the first 256 Unicode Characters
The following table contains the complete ISO Latin-1 character set,
corresponding to the first 256 entries of the Unicode character repertoire
in Microsoft® Internet Explorer 4.0 and later. The table provides each
character, its decimal code, its named entity reference for HTML, and also
a brief description. See also:
http://msdn.microsoft.com/workshop/author/dhtml/reference/charsets/charsets.asp
Character | Decimal code | Named entity | Description |
---|---|---|---|
--- | � | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- | 	 | --- | Horizontal tab |
--- | | --- | Line feed |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- | | --- | Carriage Return |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
--- |  | --- | Unused |
  | --- | Space | |
! | ! | --- | Exclamation mark |
" | " | " | Quotation mark |
# | # | --- | Number sign |
$ | $ | --- | Dollar sign |
% | % | --- | Percent sign |
& | & | & | Ampersand |
' | ' | --- | Apostrophe |
( | ( | --- | Left parenthesis |
) | ) | --- | Right parenthesis |
* | * | --- | Asterisk |
+ | + | --- | Plus sign |
, | , | --- | Comma |
- | - | --- | Hyphen |
. | . | --- | Period (fullstop) |
/ | / | --- | Solidus (slash) |
0 | 0 | --- | Digit 0 |
1 | 1 | --- | Digit 1 |
2 | 2 | --- | Digit 2 |
3 | 3 | --- | Digit 3 |
4 | 4 | --- | Digit 4 |
5 | 5 | --- | Digit 5 |
6 | 6 | --- | Digit 6 |
7 | 7 | --- | Digit 7 |
8 | 8 | --- | Digit 8 |
9 | 9 | --- | Digit 9 |
: | : | --- | Colon |
; | ; | --- | Semicolon |
< | < | < | Less than |
= | = | --- | Equals sign |
> | > | > | Greater than |
? | ? | --- | Question mark |
@ | @ | --- | Commercial at |
A | A | --- | Capital A |
B | B | --- | Capital B |
C | C | --- | Capital C |
D | D | --- | Capital D |
E | E | --- | Capital E |
F | F | --- | Capital F |
G | G | --- | Capital G |
H | H | --- | Capital H |
I | I | --- | Capital I |
J | J | --- | Capital J |
K | K | --- | Capital K |
L | L | --- | Capital L |
M | M | --- | Capital M |
N | N | --- | Capital N |
O | O | --- | Capital O |
P | P | --- | Capital P |
Q | Q | --- | Capital Q |
R | R | --- | Capital R |
S | S | --- | Capital S |
T | T | --- | Capital T |
U | U | --- | Capital U |
V | V | --- | Capital V |
W | W | --- | Capital W |
X | X | --- | Capital X |
Y | Y | --- | Capital Y |
Z | Z | --- | Capital Z |
[ | [ | --- | Left square bracket |
\ | \ | --- | Reverse solidus (backslash) |
] | ] | --- | Right square bracket |
^ | ^ | --- | Caret |
_ | _ | --- | Horizontal bar (underscore) |
` | ` | --- | Acute accent |
a | a | --- | Small a |
b | b | --- | Small b |
c | c | --- | Small c |
d | d | --- | Small d |
e | e | --- | Small e |
f | f | --- | Small f |
g | g | --- | Small g |
h | h | --- | Small h |
i | i | --- | Small i |
j | j | --- | Small j |
k | k | --- | Small k |
l | l | --- | Small l |
m | m | --- | Small m |
n | n | --- | Small n |
o | o | --- | Small o |
p | p | --- | Small p |
q | q | --- | Small q |
r | r | --- | Small r |
s | s | --- | Small s |
t | t | --- | Small t |
u | u | --- | Small u |
v | v | --- | Small v |
w | w | --- | Small w |
x | x | --- | Small x |
y | y | --- | Small y |
z | z | --- | Small z |
{ | { | --- | Left curly brace |
| | | | --- | Vertical bar |
} | } | --- | Right curly brace |
~ | ~ | --- | Tilde |
--- |  | --- | Unused |
  | | Nonbreaking space also use <nobr>...</nobr> around text. | |
¡ | ¡ | ¡ | Inverted exclamation |
¢ | ¢ | ¢ | Cent sign |
£ | £ | £ | Pound sterling |
¤ | ¤ | ¤ | General currency sign |
¥ | ¥ | ¥ | Yen sign |
¦ | ¦ | ¦ or &brkbar; | Broken vertical bar |
§ | § | § | Section sign |
¨ | ¨ | ¨ or ¨ | Diæresis / Umlaut |
© | © | © | Copyright |
ª | ª | ª | Feminine ordinal |
« | « | « | Left angle quote, guillemet left |
¬ | ¬ | ¬ | Not sign |
| ­ | ­ | Soft hyphen |
® | ® | ® | Registered trademark |
¯ | ¯ | ¯ or &hibar; | Macron accent |
° | ° | ° | Degree sign |
± | ± | ± | Plus or minus |
² | ² | ² | Superscript two |
³ | ³ | ³ | Superscript three |
´ | ´ | ´ | Acute accent |
µ | µ | µ | Micro sign |
¶ | ¶ | ¶ | Paragraph sign |
· | · | · | Middle dot |
¸ | ¸ | ¸ | Cedilla |
¹ | ¹ | ¹ | Superscript one |
º | º | º | Masculine ordinal |
» | » | » | Right angle quote, guillemet right |
¼ | ¼ | ¼ | Fraction one-fourth |
½ | ½ | ½ | Fraction one-half |
¾ | ¾ | ¾ | Fraction three-fourths |
¿ | ¿ | ¿ | Inverted question mark |
À | À | À | Capital A, grave accent |
Á | Á | Á | Capital A, acute accent |
 |  |  | Capital A, circumflex |
à | à | à | Capital A, tilde |
Ä | Ä | Ä | Capital A, diæresis / umlaut |
Å | Å | Å | Capital A, ring |
Æ | Æ | Æ | Capital AE ligature |
Ç | Ç | Ç | Capital C, cedilla |
È | È | È | Capital E, grave accent |
É | É | É | Capital E, acute accent |
Ê | Ê | Ê | Capital E, circumflex |
Ë | Ë | Ë | Capital E, diæresis / umlaut |
Ì | Ì | Ì | Capital I, grave accent |
Í | Í | Í | Capital I, acute accent |
Î | Î | Î | Capital I, circumflex |
Ï | Ï | Ï | Capital I, diæresis / umlaut |
Ð | Ð | Ð | Capital Eth, Icelandic |
Ñ | Ñ | Ñ | Capital N, tilde |
Ò | Ò | Ò | Capital O, grave accent |
Ó | Ó | Ó | Capital O, acute accent |
Ô | Ô | Ô | Capital O, circumflex |
Õ | Õ | Õ | Capital O, tilde |
Ö | Ö | Ö | Capital O, diæresis / umlaut |
× | × | × | Multiply sign |
Ø | Ø | Ø | Capital O, slash |
Ù | Ù | Ù | Capital U, grave accent |
Ú | Ú | Ú | Capital U, acute accent |
Û | Û | Û | Capital U, circumflex |
Ü | Ü | Ü | Capital U, diæresis / umlaut |
Ý | Ý | Ý | Capital Y, acute accent |
Þ | Þ | Þ | Capital Thorn, Icelandic |
ß | ß | ß | Small sharp s, German sz |
à | à | à | Small a, grave accent |
á | á | á | Small a, acute accent |
â | â | â | Small a, circumflex |
ã | ã | ã | Small a, tilde |
ä | ä | ä | Small a, diæresis / umlaut |
å | å | å | Small a, ring |
æ | æ | æ | Small ae ligature |
ç | ç | ç | Small c, cedilla |
è | è | è | Small e, grave accent |
é | é | é | Small e, acute accent |
ê | ê | ê | Small e, circumflex |
ë | ë | ë | Small e, diæresis / umlaut |
ì | ì | ì | Small i, grave accent |
í | í | í | Small i, acute accent |
î | î | î | Small i, circumflex |
ï | ï | ï | Small i, diæresis / umlaut |
ð | ð | ð | Small eth, Icelandic |
ñ | ñ | ñ | Small n, tilde |
ò | ò | ò | Small o, grave accent |
ó | ó | ó | Small o, acute accent |
ô | ô | ô | Small o, circumflex |
õ | õ | õ | Small o, tilde |
ö | ö | ö | Small o, diæresis / umlaut |
÷ | ÷ | ÷ | Division sign |
ø | ø | ø | Small o, slash |
ù | ù | ù | Small u, grave accent |
ú | ú | ú | Small u, acute accent |
û | û | û | Small u, circumflex |
ü | ü | ü | Small u, diæresis / umlaut |
ý | ý | ý | Small y, acute accent |
þ | þ | þ | Small thorn, Icelandic |
ÿ | ÿ | ÿ | Small y, diæresis / umlaut |
Questions:
file: /Techref/language/html/charset.htm, 25KB, , updated: 2006/7/28 14:20, local time: 2024/11/15 20:23,
18.190.217.167:LOG IN
|
©2024 These pages are served without commercial sponsorship. (No popup ads, etc...).Bandwidth abuse increases hosting cost forcing sponsorship or shutdown. This server aggressively defends against automated copying for any reason including offline viewing, duplication, etc... Please respect this requirement and DO NOT RIP THIS SITE. Questions? <A HREF="http://linistepper.com/techref/language/html/charset.htm"> Character Set Encoding</A> |
Did you find what you needed? |