HTML Character Encoding

HTML Character Encoding

Different types of Character set

CharacterName of EntityHexadecimalEntity Decimal
space     
!!!!
" """
####
$$$$
%%%%
&& &&&
'''
((((
))))
** ***
++++
,,,-
−−-
....
///&#47
0  0
1  1
2  2
3  3
4  4
5  5
6  6
7  7
8  8
9  9
::::
;;&#x0003B;&#59;
<&lt; &LT;&#x0003C;&#60;
=&equals;&#x0003D;&#61;
>&gt; &GT;&#x0003E;&#62;
?&quest;&#x0003F;&#63;
@&commat;&#x00040;&#64;

Alphabets -Latin(CAPITAL LETTERS)

Capital CharacterName of EntityHexadecimalEntity Decimal
𝒜&Ascr;&#x1D49C;&#119964;
&Bscr;&#x0212C;&#8492;
𝒞&Cscr;&#x1D49E;&#119966;
𝒟&Dscr;&#x1D49F;&#119967;
&Escr;&#x02130;&#8496;
&Fscr;&#x02131;&#8497;
𝒢&Gscr;&#x1D4A2;&#119970;
&Hscr;&#x0210B;&#8459;
&Iscr;&#x02110;&#8464;
𝒥&Jscr;&#x1D4A5;&#119973;
𝒦&Kscr;&#x1D4A6;&#119974;
&Lscr;&#x02112;&#8466;
&Mscr;&#x02133;&#8499;
𝒩&Nscr;&#x1D4A9;&#119977;
𝒪&Oscr;&#x1D4AA;&#119978;
𝒫&Pscr;&#x1D4AB;&#119979;
𝒬&Qscr;&#x1D4AC;&#119980;
&Rscr;&#x0211B;&#8475;
𝒮&Sscr;&#x1D4AE;&#119982;
𝒯&Tscr;&#x1D4AF;&#119983;
𝒰&Uscr;&#x1D4B0;&#119984;
𝒱&Vscr;&#x1D4B1;&#119985;
𝒲&Wscr;&#x1D4B2;&#119986;
𝒳&Xscr;&#x1D4B3;&#119987;
𝒴&Yscr;&#x1D4B4;&#119988;
𝒵&Zscr;&#x1D4B5;&#119989;

Character Encoding From 91 to 96

CharacterName of EntityHexadecimalEntity Decimal
[&lsqb; &lbrack;&#x0005B;&#91;
\&bsol;&#x0005C;&#92;
]&rsqb; &rbrack;&#x0005D;&#93;
^&Hat;&#x0005E;&#94;
_&lowbar;&#x0005F;&#95;
`&grave; &DiacriticalGrave;&#x00060;&#96;

Alphabets -Latin(small letters)

Small AlphabetsName of EntityHexadecimalEntity Decimal
𝒶&ascr;&#x1D4B6;&#119990;
𝒷&bscr;&#x1D4B7;&#119991;
𝒸&cscr;&#x1D4B8;&#119992;
𝒹&dscr;&#x1D4B9;&#119993;
&escr;&#x0212F;&#8495;
𝒻&fscr;&#x1D4BB;&#119995;
&gscr;&#x0210A;&#8458;
𝒽&hscr;&#x1D4BD;&#119997;
𝒾&iscr;&#x1D4BE;&#119998;
𝒿&jscr;&#x1D4BF;&#119999;
𝓀&kscr;&#x1D4C0;&#120000;
𝓁&lscr;&#x1D4C1;&#120001;
𝓂&mscr;&#x1D4C2;&#120002;
𝓃&nscr;&#x1D4C3;&#120003;
&oscr;&#x02134;&#8500;
𝓅&pscr;&#x1D4C5;&#120005;
𝓆&qscr;&#x1D4C6;&#120006;
𝓇&rscr;&#x1D4C7;&#120007;
𝓈&sscr;&#x1D4C8;&#120008;
𝓉&tscr;&#x1D4C9;&#120009;
𝓊&uscr;&#x1D4CA;&#120010;
𝓋&vscr;&#x1D4CB;&#120011;
𝓌&wscr;&#x1D4CC;&#120012;
𝓍&xscr;&#x1D4CD;&#120013;
𝓎&yscr;&#x1D4CE;&#120014;
𝓏&zscr;&#x1D4CF;&#120015;

KEYBOARD CHARACTERS

Keyboard CharactersName of EntityHexadecimalEntity Decimal
˜&tilde; &DiacriticalTilde;&#x002DC;&#732;
`&grave; &DiacriticalGrave;&#x00060;&#96;
!&excl;&#x00021;&#33;
@&commat;&#x00040;&#64;
#&num;&#x00023;&#35;
$&dollar;&#x00024;&#36;
%&percnt;&#x00025;&#37;
^&Hat;&#x0005E;&#94;
&&amp; &AMP;&#x00026;&#38;
*&ast; &midast;&#x0002A;&#42;
(&lpar;&#x00028;&#40;
)&rpar;&#x00029;&#41;
_&lowbar;&#x0005F;&#95;
&minus;&#x2212;&#8722;
+&plus;&#x0002B;&#43;
=&equals;&#x0003D;&#61;
\&bsol;&#x0005C;&#92;
/&sol;&#x0002F;&#47
|&verbar; &vert; &VerticalLine;&#x0007C;&#124;
{&lcub; &lbrace;&#x0007B;&#123;
}&rcub; &rbrace;&#x0007D;&#125;
[&lsqb; &lbrack;&#x0005B;&#91;
]&rsqb; &rbrack;&#x0005D;&#93;
:&colon;&#x0003A;&#58;
;&semi;&#x0003B;&#59;
&quot; &QUOT;&#x00022;&#34;
&apos;&#x00027;&#39;
&rsquo; &rsquor; &CloseCurlyQuote;&#x02019;&#8217;
&lsquo; &OpenCurlyQuote;&#x02018;&#8216;
<&lt; &LT;&#x0003C;&#60;
>&gt; &GT;&#x0003E;&#62;
,&comma;&#x0002C;&#44;
.&period;&#x0002E;&#46;
?&quest;&#x0003F;&#63;
¦&brvbar;&#x000A6;&#166;
space&nbsp; &NonBreakingSpace;&#x000A0;&#160;
Indian  rupee sign&#x20B9;&#8377;
HTML Character Encoding Example

HTML Character Encoding

To control HTML Character Encoding ASCII uses from 0 to 31 (and 127) values

For digits, symbols and letters ASCII uses the values from 32 to 126

Values from 128 to 255 ASCII will not use.

Character set ANSI (Windows-1252)

Values from 0 to 127 ANSI is identical to ASCII.

Values from 160 to 155 ANSI is identical to UTF-8

Values from 128 to 159 ANSI have a proprietary set of characters.

Character set ISO-8859-1

Values from 0 to 127 8859-1 are identical to ASCII.

Values from 128 to 159 8859-1 will not use the values.

Values from 160 to 255 UTF-8 is identity in 8859-1

Character set UTF-8

For ASCII values 0 to 127 character set UTF-8 is identical.

Values from 128 to 159 character set UTF-8 will not use the values.

Values from 160 to 155 for both ANSI and 8859-1 character set UTF-8 are identical.

Interview Questions & Answer

Character set UTF-8

  • The ASCII values 0 to 127 character set UTF-8 is identical.
  • Values from 128 to 159 character set UTF-8 will not use the values.
  • Values from 160 to 155 for both ANSI and 8859-1 character set UTF-8 are identical.

Tutorials for all brains!